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TITLE 

MAIZE GLUTATHIONE-S-TRANSFERASE ENZYMES 
FIELD OF THE INVENTION 
This invention is in the field of plant molecular biology. More 
5 specifically, this invention pertains to nucleic acid firagments encoding maize 
glutathione-S-transferase (GST) enzymes involved in the detoxification of 
xenobiotic compounds in plants and seeds. 

BACKGROUND OF THE INVENTION 
Glutathione-S-transferases (GST) are a family of enzymes v/bich catalyze 
10 the conjugation of glutathione, homoglutathione QiGSH) and other ghitathione- 
like analogs via a sulfhydryl group, to a large range of hydrophobic, electrophilic 
compotmds. The conjugation can result in detoxification of tiiese compounds. 
GST enzymes have been identified in a range of plants including maize (Wosnick 
et al.. Gene (Amst) 76 (1) (1989) 153-160; Rossini et al., Plant Physiology 
15 (Rockville) 1 12 (4) (1996) 1595-1600; Holt ct al., Planta (Heidelberg) 196 (2) 

(1995) 295-302), wheat (Edwards et al.. Pestle. Blochem. Physiol. (1996) 54(2), 
96-104), sorghum (Hatzios et al., J. Environ. Sci. Health, Part B (1996), B31(3), 
545-553), arabidopsis (Van Der Kop et al.. Plant Molecular Biology 30 (4) 

(1996) , sugarcane (Singhal et al., Phytochemistry (OXF) 30 (5) (1991) 

20 1409-1414), soybean (Flury et al., Physiologia Plantarum 94 (1995) 594-604) and 
peas (Edwards R., Physiologia Plantarum 98 (3) (1996) 594-604). GST's can 
con^rise a significant portion of total plant protein, for example attaining from 1 
to 2% of the total soluble protein in etiolated maize seedlings (Timmennann, 
Physiol Plant. (1989) 77(3), 465-71). 

25 Glutathione S-transferases (GSTs; EC 2.5 . 1 . 18) catalyze the nucleophilic 

attack of the thiol group of GSH to various electrophilic substrates. Then: 
functions and regulation in plants has been recently reviewed (Marts et al., Annu 
Rev Plant Physiol Plant Mol Biol 47:127-58 (1996); Droog, F. J Plant Growth 
Regul 16:95-107, (1997)). They are present at every stage of plant development 

30 from early embryogenesis to senescence and in every tissue class examined. The 
agents that have been shown to cause an increase in GST levels have the 
potential to cause oxidative destruction in plants, suggesting a role for GSTs in 
the protection from oxidative damage. In addition to their role in the protection 
from oxidative damage, GSTs have the ability to nonenzymatically bind certain 

35 small molecules, such as auxin (ZetU et al. , PNAS 91 :689-693, (1994)) and 
perhaps regulate their bioavailability. Furthermore the addition of GSH to a 
molecule serves as an "address" to send that molecule to the plant vacuole 
(Marrs et al.. Nature 375:397-400, (1995)). 
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GSTs have also been implicated in the detoxification of certain 
herbicides. Maize GSTs have been well characterized in relation to herbicide 
metabolism. Three genes from maize have been cloned: GST29 (Shahetal., 
Plant Mol Biol 6, 203-21 1(1986)), GST 27 (Jepson et al., Plant Mol Biol 

5 26:1 855-1 866, (1994)), GST 26 (Moore et al.. Nucleic Acids Res 14:7227-7235 
(1986)). These gene products form four GST isoforms: GST I (a homodimer of 
GST 29), GST H (a heterodimer of GST 29 and GST 27), GST ffl (a homodimer 
of GST 26), and GST IV (a homodimer of GST 27). GST 27 is highly inducible 
by safener compounds (Jepson (1994) supra; Holt et al., Planta 196:295-302, 

10 (1995)) and overexpression of GST 27 in tobacco confers alachlor resistance to 
transgenic tobacco (Jepson, personal communication). Additionally, Bridges et 
al. (U.S. 5589614) disclose the sequence of a maize derived GST isoform n 
promoter useful for the expression of foreign genes in maize and wheat. In 
soybean, herbicide compounds conjugated to hGSH have been detected and 

15 correlated with herbicide selectivity (Ftear et al., Physiol 20: 299-3 10 (1983); 
Brown et al.. Pest Biochem Physiol 29:1 12-120, (1987)). This implies that 
hGSH conjugation is an inq>ortant determinant in soybean herbicide selectivity 
although this hypothesis has not been diaracterized on a molecular level. 

Some efforts have been made to alter plant phenotypes by the expression 

20 of either plant or nuunmalian foreign GST genes or their promoters in mature 
plant tissue. For example, Hehner et al. (U.S. 5073677) teach the expression of a 
rat GST gene in tobacco under the control of a strong plant promoter. Similarly, 
Jepson et al. (WO 97/1 1 1 89) disclose a chemically inducible maize GST promoter 
useftd for the expression of foreign proteins in plants; Chilton et al. (EP 256223) 

25 discuss the construction of herbicide tolerant plants expressing a foreign plant 
GST gene; and Bieseler et al. (WO 96/23072) teach DNA encoding GSTIIIc, its 
recombinant production and transgenic plants containing the DNA having a 
herbicide-tolerantphenotype. 

Manipulation of nucleic acid fragments encoding soybean GST to use in 

30 screening in assays, the creation of herbicide-tolerant transgenic plants, and 

altered production of GST enzymes depend on the heretofore unrealized isolation 
of nucleic acid fragments that encode all or a substantial portion of a soybean GST 
enzyme. 

SUMMARY OF THE INVENTION 
35 The present invention provides nucleic acid fragments isolated from maize 

encoding all or a substantial portion of a GST enzyme. The isolated nucleic acid 
fragment is selected from the group consisting of (a) an isolated nucleic acid 
fragment encoding all or a substantial portion of the amino acid sequence selected 
from the group consisting of SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:6, SEQ 

2 
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ID N0:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24; (b) an isolated 
nucleic acid fragment that is substantially similar to an isolated nucleic acid 
fragment encoding all or a substantial portion of the amino acid sequence 

5 sequence selected from the group consisting of SEQ ID N0:2, SEQ ID N0:4, 
SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO:10, SEQ ID N0:12, SEQ ID N0:14, 
SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID N0:22, and SEQ ID 
NO:24; and (c) an isolated nucleic acid fragment that is complementary to (a) or 
(b). The nucleic acid fragments and corresponding polypeptides are contained in 

10 the accompanying Sequence Listing and described in the Brief Description of the 
Invention. 

In another embodiment, the instant invention relates to chimeric genes 
encoding maize GST enzymes or to chimeric genes that comprise nucleic acid 
fragments as described above, the chimeric genes operably linked, to suitable 

15 regulatory sequences, wherein expression of the chimeric genes results in altered 
levels of the encoded enzymes in transformed host cells. 

The present invention further provides a transformed host cell comprising 
the above described chimeric gene. The transformed host cells can be of 
eukaryotic or prokaryotic origin. The Invention also includes transformed plants 

20 that arise from transformed host cells of higher plants, and from seeds derived 
from such transforaied plants, and subsequent progeny. 

Additionally, the invention provides methods of altering the level of 
expression of a maize GST enzyme in a host ceil comprismg the steps of; 
(i) transforming a host cell with the above described chuneric gene and; 

25 (ii) growing the transformed host cell produced in step (i) under conditions that 
are suitable for expression of the chimeric gene wherein expression of the 
chimeric gene results in production of altered levels of a plant GST enzyme in the 
transformed host cell relative to expression levels of an untransformed host cell. 
In an altemate embodiment, the present invention provides methods of 

30 obtaining a nucleic acid fragment encoding all or substantially all of the amino 
acid sequence encodmg a maize GST enzyme comprising either hybridization or 
primer-directed amplification methods known in the art and using the above 
described nucleic acid fragment. A primer-amplification-based method uses SEQ 
IDNOS.: 1,3,5,7,9, 11, 13, 15, 17, 19, 21, or 23. The product of these methods 

35 is also part of the invention. 

Another embodiment of the invention includes a method for identifying a 
compound that inhibits the activity of a maize GST enzyme encoded by the 
nucleic acid fi^gment and substantially similar and complementary nucleic acid 
fragments of SEQ IDNOS.: 1-24. The method has the steps: (a) transforming a 
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host cell with the above described chimeric gene; (b) growing the transfonned 
host cell under conditions that are suitable for expression of the chimeric gene 
wherein expression of the chimeric gene results in production of the GST enzyme; 
(c) optionally purifying the GST enzyme expressed by the transformed host cell; 

5 (d) contacting the GST enzyme with a chemical compound of interest; and 
(e) identifying the chemical compound of interest that reduces the activity of the 
maize GST enzyme relative to the activity of the maize GST enzyme in the 
absence of the chemical compound of interest. 

This method may further include conducting step (d) in the presence of at 

10 least one electrophilic substrate and at least one thiol donor. The isolated nucleic 
acid fragments of this method are chosen from the group represented by SEQ ID 
NOS.: 1, 3, 5. 7, 9. 11, 13, 15, 17, 19, 21, and 23, and the maize GST enzyme is 
selected from the group consisting of SEQ ID NOS.: 2, 4, 6, 8,10, 12, 14, 16, 18, 
20, 22, and 24. 

15 The invention further provides a method for identifying a chemical 

compound that inhibits the activity of the maize GST enzyme as described herein, 
• wherein the identification is based on a comparison of the phenotype of a plant 
transformed with the above described chimeric gene contacted with the inhibitor 
candidate with the phenotype of a transfonned plant that is not contacted with the 

20 inhibitor candidate. The isolated nucleic acid fragment of this method is selected 
from the group consisting of SEQ ID NOS.: 1,3,5,7,9, 11,13, 15, 17. 19.21, 
and 23 and the maize GST enzyme is selected from the group consisting of SEQ 
ID NOS.: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 24. 

In another embodiment, the invention provides a method for identifying a 

25 substrate for the nxaize GST enzyme. The method comprises the steps of: 
(a) transforming a host cell vwth a chimeric gene comprising the nucleic acid 
fragment as described herein, the chimeric gene encoding a maize GST enzyme 
operably linked to at least one suitable regulatory sequence; (b) grovwng the 
transformed host cell of step (a) under conditions that are suitable for expression 

30 of the chimeric gene resulting in production of the GST enzyme; (c) optionally 
purifying the GST enzyme expressed by the transformed host cell; (d) contacting 
the GST enzyme with a substrate candidate; and (e) comparing the activity of 
maize GST enzyme with the activity of maize GST enzyme that has been 
contacted with the substrate candidate and selecting substrate candidates that 

35 increase the activity of the maize GST enzyme relative to the activity of maize 
GST enzyme in the absence of the substrate candidate. More preferably, step (d) 
of this method is carried out in the presence of at least one thiol donor. The 
isolated nucleic acid fragment of this method is selected from the group consisting 
of SEQ ID NOS.: 1, 3, 5, 7, 9. 1 1, 13, 15, 17, 19, 21, and 23. 25 and the maize 

4 
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GST enzyme is selected from the group consisting of SEQ ID NOS.: 2, 4, 6, 8, 
10, 12, 14,16,18,20, 22, and 24. 

Alternatively, methods are provided for identifying a maize GST substrate 
candidate wherein: the identification of the substrate candidate is based on a 
5 comparison of the phenotype of a host cell transformed with a chimeric gene 

expressing a maize GST enzyme and contacted with a substrate candidate with the 
phenotype of a similarly transformed host cell grown without contact with a 
substrate candidate. 

The isolated nucleic acid fragment of this method is selected from the 
10 group consisting of SEQ ID NOS.: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, and 23 and 
the maize GST enzyme is selected from the group consisting of SEQ ID NOS.: 2, 
4, 6, 8, 10, 12, 14, 16, 18, 20, 22, and 24. 

BRIEF DESCRIPTION OF SEQUENCE DESCRIPTIONS 
AND BIOLOGICAL DEPOSITS 
1 5 The invention can be more fiiUy understood from the following detailed 

description and the accompanying sequence descriptions and biological deposits 
which form a part of this application. 

The following sequence descriptions and sequences listings attached 
hereto comply with the rules gbvemmg nucleotide and/or amino acid sequence 
20 disclosures in patent applications as set forth in 37 C.F.R. §1.821-1.825, The 
Sequence Descriptions contain the one letter code for nucleotide sequence 
characters and the three letter codes for amino acids as defined in conformity with 
the lUPAC-IYUB standards d^sonhed in Nucleic Acids Research 75:3021-3030 
(1985) and in the BiochemicalJourml 219 (No. 2;:345-373 (1984) which are 
25 herein incorporated by reference. 

SEQ ID N0:1 is the nucleotide sequence comprising the cDNA msert in 
clone bmsl.pk0023.g8 encoding a maize GST. 

SEQ ID N0:2 is the deduced amino acid sequence of the nucleotide 
sequence comprising the cDNA insert in clone bmsl..pk0023.g8. 
30 SEQ ID N0:3 is the nucleotide sequence comprising the cDNA insert in 

clone cs.pkOO 1 0x5 encoding a maize GST. 

SEQ ID N0:4 is the deduced ammo acid sequence of the nucleotide 
sequence comprising the cDNA insert in clone cs.pk0010.c5. 

SEQ ID N0:5 is the nucleotide sequence comprising the cDNA insert in 
35 clone cebl .pkOOl 7.a5 encoding a maize GST. 

SEQ ID N0:6 is the deduced amino acid sequence of the nucleotide 
sequence comprising the cDNA insert in clone cebl.pk0017.a5. 

SEQ ID NO: 7 is the nucleotide sequence comprising the cDNA insert in 
clone cc71se-a.pk0001.g2 encoding a maize class III GST. 
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SEQ ID N0:8 is the deduced amino acid sequence of the nucleotide 
sequence comprising the cDNA insert in clone cc71se-a.pk0001,g2. 

SEQ ID N0:9 is the nucleotide sequence comprising the cDNA insert in 
clone cc71se-b.pk0014.b8 encoding a maize class III GST. 
5 SEQ ID NO: 1 0 is the deduced amino acid sequence of the nucleotide 

sequence comprising the cDNA insert in clone cc71se-b.pk0014.b8. 

SEQ ID NO: 1 1 is the nucleotide sequence comprising the cDNA insert in 
clone ceb5.pk005 1 .f8 encoding a maize class III GST. 

SEQ ID NO: 1 2 is the deduced amino acid sequence of the nucleotide 
10 sequence comprising the cDNA insert in clone ceb5.pk005 1 M. 

SEQ ID NO: 13 is the nucleotide sequence comprising the cDNA insert in 
clone crln.pk0003.bl encoding a maize class III GST. 

SEQ ID NO: 14 is the deduced amino acid sequence of the nucleotide 
sequence comprising the cDNA insert in clone crln.pk0003,bl . 
15 SEQ ID NO: 1 5 is the nucleotide sequence comprising the cDNA insert in 

clone crln.pk0014.g8 encoding a maize class III GST. 

SEQ ID NO: 1 6 is the deduced amino acid sequence of the nucleotide 
sequence comprising the cDNA insert in clone crln.pk0014.g8. 

SEQ ID NO: 1 7 is the nucleotide sequence comprising the cDNA insert in 
20 clone m.l5.5.d06.sk20 encoding a maize class II GST. 

SEQ ID NO: 1 8 is the deduced amino acid sequence of the nucleotide 
sequence comprising the cDNA insert in clone m.l5.5.d06.sk20. 

SEQ ID NO: 1 9 is the nucleotide sequence comprising the cDNA insert in 
clone crln.pkOO4O.ei2 encoding a maize class II GST. 
25 SEQ ID NO:20 is the deduced amino acid sequence of the nucleotide 

sequence comprising the cDNA insert in clone crln.pk0040.el2. 

SEQ ID N0:21 is the nucleotide sequence comprising the cDNA insert in 
clone ceb5.pk0049.al 1 encoding a maize class III GST. 

SEQ ID NO:22 is the deduced amino acid sequence of the nucleotide 
30 sequence comprising the cDNA insert in clone ceb5.pk0049.al 1 . 

SEQ ID NO:23 is the nucleotide sequence comprising the cDNA insert in 
clone csLpk0059.e2 encoding a maize class III GST. 

SEQ ID NO:24 is the deduced amino acid sequence of the nucleotide 
sequence comprising the cDNA insert m clone csl.pk0059.e2. 
35 The transformed £. coli ceb5.pk0051.f8/pET30(LIC)BL21(DE3) 

containing the gene ceb5.pk005 1 .f8 in a pET30(LIC) vector encoding a maize 
class III GST was deposited on 21 August 1997 with the American Type Culture 
Collection (ATCC), 12301 Pariclawn Drive, Rockville, MY 20852, U.S.A., under 
the terms of the Budapest Treaty on the International Recognition of the Deposit 

6 
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of Micro-organisms for the Purpose of Patent Procedure. The deposit is 
designated as ATCC 985 1 1. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides novel GST nucleotide sequences and 

5 encoded proteins isolated from maize. GST enzymes are known to function in 
the process of detoxification of a variety of xenobiotic compounds in plants, 
most notably, herbicides. Nucleic acid fragments encoding. at least a portion of 
several maize GST enzymes have been isolated and identified by comparison of 
random plant cDNA sequences to public databases containing nucleotide and 

10 protein sequences using the BLAST algorithms \vell known to those skilled in 
the art. The sequences of the present invention are useful in the construction of 
herbicide-tolerant transgenic plants, in the recombinant production of GST 
enzymes, in the development of screening assays to identify compounds mhibitory 
to the GST enzymes, and in screening assays to identify chemical substrates of the 

15 GSTs. 

In the context of this disclosure, a number of terms shall be utilized. 
As use herein "Glutathione S-Transferase" or "GST" refers to any plant 
derived glutathione S-transferase (GST) enzyme capable of catalyzmg the 
conjugation of glutathione, homoglutathione and other glutathione-like analogs 

20 via a sulfhydryl group, to hydrophobic and electropliilic compoxmds. The term 
GST includes amino acid sequences longer or shorter than the length of natural 
GSTs, such as functional hybrid or partial fragments of GSTs, or thek analogues. 
As used herein "GST" is not intended to be delimited on the basis of enzyme 
activity but may encompass amino acid sequences that possess no measurable 

25 enzyme activity but are substantially similar in to those sequences, known in the 
art to possess the above mentioned glutathione conjugating activity. 

The term '^class" or "GST class** refers to a grouping of the various GST 
enzymes according to amino acid identity. Currently, four classes have been 
identified and are referred to as "GST class V "GST class H", "GST class ffl*' 

30 and "GST class IV" . The grouping of plant GSTs into three classes is described 
by Dvoogeidl. {Plant Physiology 107:1139-1146(1995)). All available amino 
acid sequences were aligned using the Wisconsin Genetics Computer Group 
package (Wisconsin Package Version 9.0, Genetics Computer Group (GCG), 
Madison, WI), and graphically represented on a phylogenetic tree. Three groups 

35 were identified: class one including the archetypical sequences from maize GST I 
(X06755) and GST III (X04375); class two including the archetypical sequence 
from Dianthus caryophyllus (M64628); and class three mcluding the archetypical 
sequence soybean GH2/4 (M20363). Recently, Applicants have established a 



7 
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further subgroup of the plant GSTs known as class IV GSTs with its archetypical 
sequence being In2-1 (X58573). 

As used herein, an "isolated nucleic acid fragment" is a polymer of RNA 
or DNA that is single- or double-stranded, optionally containing synthetic, non- 

5 natural or altered nucleotide bases. An isolated nucleic acid fragment in the form 
of a polymer of DNA may be comprised of one or more segments of cDNA, 
genomic DNA or synthetic DNA, 

As used herein, "substantially similar" refers to nucleic acid fragments 
wherein changes in one or more nucleotide bases results in substitution of one or 

10 more amino acids, but do not affect the functional properties of the protein 

encoded by the DNA sequence. "Substantially similar" also refers to nucleic acid 
fragments wherein changes in one or more nucleotide bases does not affect the 
ability of the nucleic acid fragment to mediate alteration of gene expression by 
antisense or co-suppression technology. "Substantially similar^ ' also refers to 

15 modifications of the nucleic acid fr^ments of the instant invention such as 
deletion or insertion of one or more nucleotide bases that do not substantially 
. affect the functional properties of the resultmg transcript vis-i-vis the ability to 
mediate alteration of gene expression by antisense or co-suppression technology 
or alteration of the functional properties of the resulting protein molecule. It is 

20 therefore understood that the invention encompasses more than the specific 
exemplary sequences. 

For example, it is well known in the art that antisense suppression and co- 
suppression of gene expression may be accomplished using nucleic acid fragments 
representing less that the entire coding region of a gene, and by nucleic acid 

25 fragments that do not share 1 00% identity with the gene to be suppressed. 
Moreover, alterations in a gene which result in the production of a chemically 
equivalent amino acid at a given site, but do not effect the functional properties of 
the encoded protein, are well known in the art. Thus, a codon for the amino acid 
alanine, a hydrophobic amino acid, may be substituted by a codon encoding 

30 another less hydrophobic residue (such as glycine) or a more hydrophobic residue 
(such as valine, leucine, or isoleucine). Similarly, changes which result in 
substitution of one negatively charged residue for another (such as aspartic acid 
for glutamic acid) or one positively charged residue for another (such as lysine for 
arginine) can also be expected to produce a functionally equivalent product 

35 Nucleotide changes which result in alteration of the N-terminal and C-terminal 
portions of the protein molecule would also not be expected to alter the activity of 
the protein. Each of the proposed modifications is well within the routine skill in 
the art, as is deteraiination of retention of biological activity of the encoded 
products. Moreover, the skilled artisan recognizes that substantially similar 

8 
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sequences encompassed by this invention are also defined by their ability to 
hybridize, under stringent conditions (O.IX SSC, 0.1% SDS, 65 "C), with the 
sequences exemplified herein. Preferred substantially similar nucleic acid 
fiagments of the instant invention are those nucleic acid fi-agments whose DNA 

5 sequences are at least 80% identical to the DNA sequence of the nucleic acid 

fragments reported herein. More preferred nucleic acid firagments are at least 90% 
identical to the identical to the DNA sequence of the nucleic acid fiagments 
reported herein. Most prefened are nucleic acid fiagments that are at least 95% 
identical to the DNA sequence of the nucleic acid fiagments reported herein. 

10 A "substantial portion" of an amino acid or nucleotide sequence 

comprising enough of the amino acid sequence of a polypeptide or the nucleotide 
sequence of a gene to putatively identify that polypeptide or gene, either by 
manual evaluation of the sequence by one skilled m the art, or by computer- 
automated sequence comparison and identification using algorithms such as 

15 BLAST (Basic Local Aliment Search Tool; Altschul, S. F., et al., (1993) J. Mol 
Biol. 215:403-410; see also www.ncbi.ntaLnih.gov/BLAST/). In general, a 
sequence often or more contiguous ammo acids or thirty or more nucleotides is 
necessary in order to putatively identify a polypeptide or nucleic acid sequence as 
homologous to a known protein or gene. Moreover, with respect to nucleotide 

20 sequences, gene specific oligonucleotide i»obes comprising 20-30 contiguous 
nucleotides may be used in sequence-dependent methods of gene identification 
(e.g.. Southern hybridization) and isolation (e.g., in situ hybridization of bacterial 
colonies or bacteriophage plaques). In addition, short oligonucleotides of 
12-15 bases may be used as amplification primers in PGR in order to obtain a 

25 particular nucleic acid firj^ent comprismg the primers. Accordingly, a 

"substantial portion" of a nucleotide sequence comprises enough of the sequence 
to specifically identify and/or isolate a nucleic acid firagment comprising the 
sequence. The instant specification teaches partial or complete amino acid and 
nucleotide sequences encoding one or more particular fungal proteins. The skilled 

30 artisan, having the benefit of the sequences as reported herein, may now use all or 
a substantial portion of the disclosed sequences for purposes known to those 
skilled in this art. Accordingly, the instant invention comprises the complete 
sequences as reported in the accompanying Sequence Listing, as well as 
substantial portions of those sequences as defined above. 

35 The term "complementary" is used to describe the relationship between 

nucleotide bases that are capable to hybridizing to one another. For example, with 
respect to DNA, adenosine is complementary to thymine and cytosine is 
complementary to guanine. Accordingly, the instant invention also includes 
isolated nucleic acid fragments that are complementary to the complete sequences 
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as reported in the accompanying Sequence Listing as well as those substantially 
similar nucleic acid sequences. 

"Codon degeneracy" refers to divergence in the genetic code permitting 
variation of the nucleotide sequence without effecting the amino acid sequence of 
5 an encoded polypeptide. Accordingly, the instant mvention relates to any nucleic 
acid fragment that encodes all or a substantial portion of the amino acid sequence 
encoding the GST enzymes as set forth m SEQ ID Nos: SEQ ID N0:2, SEQ ID 
N0:4, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO:10, SEQ ID N0:12, SEQ ID 
N0:14, SEQ ID NO: 16, SEQ ID N0:18, SEQ ID NO:20, SEQ ID NO:22, and 
10 SEQ ID NO:24. The skilled artisan is well aware of the "codon-bias" exhibited by 
a specific host cell in usage of nucleotide codons to specify a given amino acid. 
Therefore, when synthesizing a gene for improved expression in a host cell, it is 
desirable to design the gene such that its frequency of codon usage approaches the 
frequency of preferred codon usage of the host cell. 
15 "Synthetic genes" can be assembled from oligonucleotide building blocks 

that are chemically synthesized using procedures known to those skilled in the art. 
These building blocks are ligated and annealed to forai gene segments which are 
then enzymatically assembled to construct the entire gene. "Chemically 
synthesized", as related to a sequence of DNA, means that the component 
20 nucleotides were assembled in vitro. Manual chemical synthesis of DNA may be 
accomplished using well established.procedures, or automated chemical synthesis 
can be performed using one of a number of commercially available machines. 
Accordingly, the genes can be tailored for optimal gene expression based on 
optimization of nucleotide sequence to reflect the codon bias of the host cell. The 
25 skilled artisan appreciates the likelihood of successful gene expression if codon 
usage is biased towards those codons favored by the host. Determination of 
preferred codons can be based on a survey of genes derived from the host cell 
where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, 
30 including regulatory sequences preceding (5* non-coding sequences) and 

following (3' non-coding sequences) the coding sequence. 'Native gene" refers to 
a gene as found in nature with its own regulatory sequences. "Chimeric gene" 
refers any gene that is not a native gene, comprising regulatory and coding 
sequences that are not found together in nature. Accordingly, a chimeric gene 
35 may comprise regulatory sequences and coding sequences that are derived from 
different sources, or regulatory sequences and codmg sequences derived from the 
same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of 
an organism. A "foreign" gene refers to a gene not normally found in the host 

10 
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organism, but that is introduced into the host organism by gene transfer. Foreign 
genes can comprise native genes inserted into a non-native organism, or chimeric 
genes. A "transgene" is a gene that has been introduced into the genome by a 
transformation procedure. 
5 "Coding sequence" refers to a DNA sequence that codes for a specific 

amino acid sequence. "Suitable regulatory sequences" refer to nucleotide 
sequences located upstream (5' non-coding sequences), within, or downstream 
(3* non-coding sequences) of a coding sequence, and which influence the 
transcription, RNA processing or stability, or translation of the associated coding 
10 sequence. Regulatory sequences may include promoters, translation leader 
sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a DNA sequence capable of controlling the 
expression of a coding sequence or functional RNA. In general, a coding 
sequence is located 3' to a promoter sequence. The promoter sequence consists of 
15 proximal and more distal upstream elements, the latter elements often referred to 
as enhancers. Accordingly, an "enhancer" is a DNA sequence which can 
stimulate promoter activity and may be an innate element of the promoter or a 
heterologous element inserted to enhance the level or tissue-specificity of a 
promoter. Promoters may be derived in their entirety from a native gene, or be 
20 composed of different elements derived fromr different promoters found in nature, 
or even comprise synthetic DNA segments. It is understood by those skilled in 
the art that different promoters may direct the expression of a gene in different 
tissues or cell types, or at different stages of development, or in response to 
different environmental conditions. Promoters which cause a gene to be 
25 expressed in most cell types at most times are commonly referred to as 

"constitutive promoters". New promoters of various types useftil in plant cells are 
constantly being discovered; numerous examples may be found in the compilation 
by Okamuro and Goldberg, (1989) Biochemistry of Plants 75: 1-82. It is further 
recognized that since in most cases the exact boundaries of regulatory sequences 
30 have not been completely defined, DNA fragments of different lengths may have 
identical promoter activity. 

The "translation leader sequence" refers to a DNA sequence located 
between the promoter sequence of a gene and the coding sequence. The 
translation leader sequence is present in the folly processed mRNA upstream of 
35 the translation start sequence. The translation leader sequence may affect 
processing of the primary transcript to mRNA, mRNA stability or translation 
efBciency. Examples of translation leader sequences have been described (Turner, 
R. and Foster, G.D. (1995) Molecular Biotechnology i:225). 
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The "3' non-coding sequences" refer to DNA sequences located 
downstream of a coding sequence and include polyadenylation recognition 
sequences and other sequences encoding regulatory signals capable of affecting 
mRNA processing or gene expression. The polyadenylation signal is usually 

5 characterized by affecting the addition of poly adenylic acid tracts to. the 3* end of 
the mRNA precursor. The use of different 3* non-coding sequences is exemplified 
by Ingelbrecht et al. ((1989) Plant Cell 7:671-680). 

"RNA transcript" refers to the product resulting from RNA polymerase- 
catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect 

10 complementary copy of the DNA sequence, it is referred to as the primary 
transcript or it may be a RNA sequence derived from posttranscriptional 
processing of the primary transcript and is referred to as the mature RNA. 
"Messenger RNA (mRNA)" refers to the RNA that is without introns and that can 
be translated into protein by the cell. "cDNA" refers to a double-stranded DNA 

15 that is complementary to and derived from mRNA. "Sense" RNA refers to RNA 
transcript that includes the mRNA and so can be translated into protein, by the cell. 
' "Antisense RNA" refers to a RNA transcript that is complementary to all or part 
of a target primary transcript or mRNA and that blocks the expression of a target 
gene (U.S. Pat No. 5,107,065). The complementarity of an antisense RNA may 

20 be with any part of the specific gene transcript, i.e., at the 5* non-coding sequence, 
3' non-coding sequence, introns, or the coding sequence. "Functional RNA" 
refers to antisense RNA, ribozyme RNA, or other RNA that is not translated yet 
has an effect on cellular processes. 

The term "operably linked" refers to the association of nucleic acid 

25 sequences on a single nucleic acid fr^ment so that the function of one is affected 
by the other. For example, a promoter is operably linked with a coding sequence 
when it is capable of affecting the expression of that coding sequence (i.e., that the 
coding sequence is imder the transcriptional control of the promoter). Coding 
sequences can be operably linked to regulatory sequences in sense or antisense 

30 orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid 
fragment of the invention. Expression may also refer to translation of mRNA mto 
a polypeptide. "Antisense inhibition" refers to the production of antisense RNA 
35 transcripts capable of suppressing the expression of the target protein. 
"Overexpression" refers to the production of a gene product in transgenic 
organisms that exceeds levels of production in normal or non-transformed 
organisms. "Co-suppression" reYers to the production of sense RNA transcripts 
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capable of suppressing the expression of identical or substantially similar foreign 
or endogenous genes (U.S. Pat. No. 5,231,020). 

"Altered levels" refers to the production of gene product(s) in transgenic 
organisms in amounts or proportions that differ from that of normal or non- 

5 transformed organisms. 

"Mature" protein refers to a post-translationally processed polypeptide; 
i.e., one from which any pre- or propeptides present in the primary translation 
product have been removed. "Precursor" protein refers to the primary product of 
translation of mRNA; i.e., with pre- and propeptides still present. Pre- and 

10 propeptides may be but are not limited to intracellular localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is 
translated in conjunction with a protein and directs the protein to the chloroplast 
or other plastid types present in the cell in which the protein is made. 
"CMoroplast transit sequence" refers to a nucleotide sequence that encodes a 

15 chloroplast transit peptide. A "signal peptide" is an anaino acid sequence which is 
translated in conjunction wdth a protein and directs the protein to the secretory 
system (Chrispeels, JJ., (1991) Ann. Rev. Plant Phys. Plant Mol Biol 42:21-53). 
If the protein is to be directed to a vacuole, a vacuolar targeting signal (supra) can 
further be added, or if to the endoplasmic reticulum, an endoplasmic reticulum 

20 retention signal (supra) may be added. If the protein is to be directed to the 
nucleus, any signal peptide present should be removed and instead a nuclear 
localization signal mcluded (Raikhel (1992) Plant Phys. 100:1627-1632). 

"Transformation" refers to the transfer of a nucleic acid fragment into the 
genome of a host organism, resulting in genetically stable inheritance. Host 

25 organisms containing the transformed nucleic acid fragments are referred to as 
"transgenic" organisms. Examples of metliods of plant transformation include 
Agrobacterium-mediated transformation (De Blaere et al. (1987) Metk Enzymol 
143:277) and particle-accelerated or "gene gun" transformation technology (Klein 
et al. (1987) Nature (London) 527:70-73; U.S. Pat. No. 4,945,050). 

30 The term "herbicide-tolerant plant" as used herein is defined as a plant that 

survives and preferably grows normally at a usually effective dose of a herbicide. 
Herbicide tolerance in plants according to the present invention refers to 
detoxification mechanisms in a plant, although the herbicide bmding or target site 
is still sensitive. 

35 "Thiol donor" refers to a compound that contains tlie structure RSH (where 

R is not equal to H). Within the context of the present invention suitable thiol 
donors may include, but are not limited to. Glutathione and homoglutathione. 

"Electrophilic substrate"' refers to a compound that is amenable to 
conjugation with glutathione or homoglutathione via a sulfliydryl group. 

13 
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Electrophilic substrates include a wide variety of compounds including pesticides, 
anti-pathogenic compounds such as fungicides and profungicides, pheramones, 
and herbicides. Within the context of the present invention electrophilic 
substrates with herbicidal activity may include, but are not limited to, 

5 chlorimuronethyl, alachlor, and atrazine, l-chloro-2,4-dinitrobenzene (CDNB), 
ethacrynic acid, t-stilbene oxide, and l,2-epoxy-3-(p-nitrophenoxy)propane. 

Standard recombinant DNA and molecular cloning techniques used 
herein are well known in the art and are described more fully in Sambrook, J., 
Fritsch, E.F. and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold 

10 Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 (heremafter 
"Maniatis"). 

"Thiol donor" refers to a compound that contains the structure RSH (where 
R is not equal to H). Within the context of the present invention suitable thiol 
donors may include, but are not limited to, Glutathione and homoglutathione. 

15 "Electrophilic substrate" refers to a compound that is amenable to 

conjugation with glutathione or homoglutathione via a sulfhydryl group, 
Electrophilic substrates include a wide variety of compounds includmg pesticides, 
anti-pathogenic compounds such as fungicides and profungicides, pheramones, 
and herbicides. Within the context of &e present mvention electrophilic 

20 substrates with herbicidal activity may include, but are not limited to, 

chlorimuronethyl, alachlor, and atrazine, l-chloro-2,4-dmitrobenzene (CDNB), 
ethacrynic acid, t-stilbene oxide, and l,2-epoxy-3-(p-nitrophenoxy)propane. 

The nucleic acid fragments of the instant invention may be used to isolate 
cDNAs and genes encoding homologous enzymes from the same or other plant 

25 species. Isolation of homologous genes using sequence-dependent protocols is 
well known m the art. Examples of sequence-dependent protocols include, but are 
not limited to, methods of nucleic acid hybridization, and methods of DNA and 
RNA amplification as exemplified by various lises of nucleic acid amplification 
technologies (e.g., polymerase chain reaction, ligase chain reaction). 

30 For example, genes encoding other GST enzymes, either as cDNAs or 

genomic DNAs, could be isolated directly by using all or a portion of the instant 
nucleic acid fragments as DNA hybridization probes to screen libraries from any 
desired plant usmg methodology well known to those skilled in the art. Specific 
oligonucleotide probes based upon the instant nucleic acid sequences can be 

35 designed and synthesized by methods known in the art (Maniatis). Moreover, the 
entire sequences can be used directly to synthesize DNA probes by methods 
known to the skilled artisan such as random primers DNA labeling, nick 
translation, or end-labeling techniques, or RNA probes using available in vitro 
transcription systems. In addition, specific primers can be designed and used to 

14 
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amplify a part of or full-length of the instant sequences. The resulting 
amplification products can be labeled directly during amplification reactions or 
labeled after amplification reactions, and used as probes to isolate full length 
cDNA or genomic fragments under conditions of appropriate stringency. 
5 In addition, two short segments of the instant nucleic acid fragments may 

be used in polymerase chain reaction protocols to amplify longer nucleic acid 
fragments encoding homologous genes from DNA or RN A. The polymerase 
chain reaction may also be performed on a library of cloned nucleic acid 
fragments wherein the sequence of one primer is derived from the instant nucleic 

10 acid fragments, and the sequence of the other primer takes advantage of the 
presence of the polyadenylic acid tracts to the 3' end of the mRNA precursor 
encoding plant genes. Alternatively, the second primer sequence may be based 
upon sequences derived from the cloning yector. For example, the skilled artisan 
can follow the RACE protocol (Frohman et al., (1988) PNAS USA 55:8998) to 

15 generate cDNAs by using PGR to amplify copies of the region between a single 
point in the transcript and the 3' or 5' end. Primers oriented in the 3* and 5' 
directions can be designed from the instant sequences. Using commercially 
available 3' RACE or 5' RACE systems (BRL), specific 3' or 5' cDNA firagments 
can be isolated (Ohara et al., (1989) PNAS USA 86:5673 ; Loh et al., (1989) 

20 Science 243:217). Products generated by the 3' and 5* RACE procedures can be 
combined to generate full-length cDNAs (Frohman, M.A. and Martin, G.R., 
(19B9) Techniques 1:165). 

Availability of the instant nucleotide and deduced amino acid sequences 
facilitates immunological screening cDNA expression libraries. Synthetic 

25 peptides representing portions of the instant amino acid sequences may be 
synthesized. These peptides can be used to immunize animals to produce 
polyclonal or monoclonal antibodies with specificity for peptides or proteins 
comprising the amino acid sequences. These antibodies can be then be used to 
screen cDNA expression libraries to isolate full-length cDNA clones of interest 

30 (Lemer, R.A. (1984) Adv. Immunol. 36:1; Maniatis). 

The nucleic acid fragments of the instant inventioii may be used to create 
transgenic plants in which the disclosed GST enzymes are present at higher or 
lower levels than normal or in cell types or developmental stages in which they 
are not normally found: This would have the effect of altering the level of GST 

35 enzyme available as well as the herbicide tolerant-phenotype of the plant. 

Overexpression of the GST enzymes of the instant invention may be 
accomplished by first constructing chimeric genes in which the coding region are 
operably linked to promoters capable of directing expression of a gene in the 
desired tissues at the desired stage of development. For reasons of convenience, 
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the chimeric genes may comprise promoter sequences and translation leader 
sequences derived from the same genes. 3' Non-coding sequences encoding 
transcription termination signals must also be provided. The instant chimeric 
genes may also comprise one or more introns in order to facilitate gene 
5 expression. 

Any combination of any promoter and any terminator capable of inducing 
expression of a GST coding region may be used in the chimeric genetic sequence. 
Some suitable examples of promoters and terminators include those from nopaline 
synthase (nos), octopine synthase (ocs) and cauliflower mosaic virus (CaMV) 

10 genes. One type of efficient plant promoter that may be used is a high level plant 
promoter. Such promoters, in operable linkage with the genetic sequence for 
GST, should be capable of promoting expression of the GST such that the 
transformed plant is tolerant to an herbicide due to the presence of, or increased 
levels of, GST enzymatic activity. High level plant promoters that may be used in 

15 this invention include the promoter of the small subunit (ss) of the ribulose-1 ,5- 
bisphosphate carboxylase from example from soybean (Berry-Lov^e et al., 
J. Molecular andApp. Gen., 1:483-498 1982)), and the promoter of the 
chlorophyll a/b binding protem. These two promoters are known to be light- 
mduced in plant cells (See, for example, Genetic Engi neering of Plants, an 

20 Agricultural Perspective, A. Cashmore, Plenum, New York (1983), pages 29-38; 
Coruzzi, G. et al.. The Journal of Biological Chemistry, 258:1399 (1983), and 
Dunsmuii:, P. et al.. Journal of Molecular and Applied Genetics, 2:285 (1983)). 

Plasmid vectors comprising the instant cliimeric genes can then 
constructed. The choice of plasmid vector depends upon the method that will be 

25 used to transform host plants. The skilled artisan is well aware of the genetic 
elements that must be present on the plasmid vector in order to successfiiUy 
transform, select and propagate host cells containing the chimeric gene. The 
skilled artisan will also recognize that different independent transformation events 
will resuk in different levels and patterns of expression (Jones et al., (1985) 

30 EMBO J. ^:241 1-2418; De Almeida et al., (1989) Mol. Gen, Genetics 275:78-86), 
and thus that multiple events must be screened m order to obtain lines displaying 
the desired expression level and pattern. Such screening may be accomplished by 
Southern analysis of DNA blots (Southern, 7. MoL Biol 98, 503, (1975)). 
Northern analysis of mRNA expression (Kroczek, J. Chromatogr. Biomed Apply 

35 618 (1-2) (1993) 133-145), Westem analysis of protein expression, or phenotypic 
analysis. 

For some applications it will be useful to direct the instant GST enzymes 
to different cellular compartments or to facilitate enzyme secretion from a 
recombinant host cell. It is thus envisioned that the chimeric genes described 

16 
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above may be further supplemented by altering the coding sequences to encode 
enzymes with appropriate intracellular targeting sequences such as transit 
sequences (Keegstra, K., Cell 55:247-253 (1989)), signal sequences or sequences 
encoding endoplasmic reticulum localization (Chrispeels, JJ., Ann. Rev, Plant 

5 Phys. Plant Mol Biol 42:21-53 (1991)), or nuclear localization signals (Raikhel, 
N. Plant Phys J 00:\627A632 (1992)) added and/or with targeting sequences that 
are already present removed. While the references cited give examples of each of 
these, the list is not exhaustive and more targeting signals of utility may be 
discovered in the future that are useful in the invention. 

10 It may also be desirable to reduce or eliminate expression of the genes 

encoding the instant GST enzymes in plants for some applications. In order to 
accomplish this, chuneric genes designed for co-suppression of the instant GST 
enzymes can be constructed by linking the genes or gene fragments encoding the 
enzymes to plant promoter sequences. Alternatively, chimeric genes designed to 

15 express antisense RNA for all or part of the instant nucleic acid fragments can be 
constructed by linking the genes or gene fragment in reverse orientation to plant 
• promoter sequences. Either the co-suppression or antisense chimeric genes could 
be introduced into plants via transformation wherein expression of the 
corresponding endogenous genes are reduced or eliminated. 

20 Plants transformed with the present GST genes will have a variety of 

phenotypes corresponding to the various properties conveyed by the GST class of 
proteins. Glutathione conjugation catalyzed by GSTs is known to result in 
sequestration and detoxification of a number of herbicides and other xenobiotics 
(MaiTS et al.,>4nww. Rev. Plant Physiol, Plant Mol. Biol 47:127-58 (1996)) and 

25 thus will be expected to produce transgenic plants with this phenotype. Other 
GST proteins are known to be induced by various environmental stresses such as 
salt stress (Roxas, et al.. Stress tolerance in transgenic seedlings that ov erexpress 
plutathione S-transferase> Annual Meeting of the American Society of Plant 
Physiologists, (August 1997), abstract 1574, Final Program, Plant Biology and 

30 Supplement to Plant Physiology, 301), exposure to ozone (Sharma et al„ Plant 
Physiology, 105 (4) (1994) 1089-1096), and exposure to industrial pollutants such 
as sulfur dioxide (Navari-Izzo et al.. Plant Science 96 (1-2) (1994) 31-40). It is 
contemplated that transgenic plants, tolerant to a Avide variety of stresses, may be 
produced by the present method by expressing foreign GST genes in suitable plant 

35 hosts. 

The instant GST enzymes produced in heterologous host cells, particularly 
in the cells of microbial hosts, can be xised to prepare antibodies to the enzymes by 
methods well known to those skilled in the art. The antibodies are useful for 
detecting the enzymes in situ in cells or in vitro in cell extracts. Preferred 
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heterologous host cells for production of the instant GST enzymes, are microbial 
hosts. Microbial expression systems and expression vectors containing regulatory 
sequences that direct high level expression of foreign proteins are well known to 
those skilled in the art. Any of these could be used to construct chimeric genes for 

5 production of the instant GST enzymes. These chimeric genes could then be 
introduced into appropriate microorganisihs via transformation to provide high 
level expression of the enzymes. 

Vectors or cassettes useful for the transformation of suitable host cells are 
well known in the art. Typically the vector or cassette contains sequences 

10 directing transcription and translation of the relevant gene, a selectable marker, 
and sequences allowing autonomous replication or chromosomal integration. 
Suitable vectors comprise a region 5' of the gene which harbors transcriptional 
initiation controls and a region 3* of the DNA fragment which controls 
transcriptional termination. It is most preferred when both control regions are 

15 derived from genes homologous to the transformed host cell, although it is to be 
understood that such control regions need not be derived from the genes native to 
• the specific species chosen as a production host 

Initiation control regions or promoters, which are useful to drive 
expression of the genes encoding the GST enzymes in the desbed host cell are 

20 numerous and familiar to those skilled in the art. Virtually any promoter capable 
of driving these genes is suitable for the present invention including but not 
limited to CYCl, fflS3, GALl, GALIO, ADHl, PGK, PH05, GAPDH, ADCl, 
TRPl, URA3, LEU2, ENO, TP! (useful for expression in Saccharomyces); AOXl 
(useful for expression in Pichia); and lac, trp, XPl, ^r, T7, tac, and trc (useful 

25 for expression in E. coli). 

Termination control regions may also be derived from various genes native 
to the preferred hosts. Optionally, a termination site may be unnecessary, 
however, it is most preferred if included. 

An example of a vector for high level expression of the instant GST 

30 enzymes in a bacterial host is provided (Example 5). 

Additionally, the instant maize GST enzymes can be used as a targets to 
facilitate design and/or identification of inhibitors of the enzymes that may be 
useful as herbicides or herbicide synergists. This is desirable because the enzymes 
described herein catalyze the sulfhydryl conjugation of glutathione to compounds 

35 toxic to the plant. Conjugation can result in detoxification of these compounds. It 
is likely that inhibition of the detoxification process will result in inhibition of 
plant growth or plant death. Thus, the instant maize GST enzymes could be 
appropriate for new herbicide or herbicide synergist discovery and design. 
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All or a portion of the nucleic acid fragments of the instant invention may 
also be used as probes for genetically and physically mapping the genes that they 
are a part of, and as markers for traits linked to expression of the instant enzymes. 
Such information may be useful in plant breeding in order to develop lines with 

5 desired phenotypes or in the identification of mutants. 

For example, the instant nucleic acid fragments may be used as restriction 
fragment length polymorphism (RFLP) markers. Southern blots (Maniatis) of 
restriction-digested plant genomic DNA may be probed v^ith the nucleic acid 
fragments of the instant invention. The resulting banding patterns may then be 

10 subjected to genetic analyses using computer programs such as MapMaker 

(Lander et at, (1987) Genomics 7: 174-181) in order to construct a genetic map. 
In addition, the nucleic acid fragments of the instant invention may be used to 
probe Southern blots containing restriction endonuclease-treated genomic DNAs 
of a set of individuals representing parent and progeny of a defined genetic cross. 

15 Segregation of the DNA polymorphisms is noted and used to calculate the 
position of the instant nucleic acid sequence in the genetic map previously 
obtained using this population (Botstein et al., (1 980) Am. J, Hum. Genet. 
32:314-331). 

The production and use of plant gene-derived probes for use in genetic 

20 mapping arc described by Bematzky, R. and Tanksley, S.D. {Plant Mol. Biol 

Reporter 4(1):^1A\ (1986)). Numerous publications describe genetic mapping of 
specific cDN A clones using the methodology outlined above or variations thereof. 
For example, F2 intercross populations, backcross populations, randomly mated 
populations, near isogenic lines, and other sets of individuals may be used for 

25 mapping. Such methodologies are well known to those skilled in the art. 

Nucleic acid probes derived from the instant nucleic acid sequences may 
also be used for physical mapping (i.e., placement of sequences on physical maps; 
see Hoheisel et al.. In: Nonmammalian Genomic Analysis: A Practical Guide, 
Academic press 1996, pp. 319-346, and references cited therein). 

30 In another embodiment, nucleic acid probes derived from the instant 

nucleic acid sequences may be used in direct fluorescence in situ hybridization 
(FISH) mapping. AlAough current methods of FISH mapping favor use of large 
clones (several to several himdred KB), unprovements in sensitivity may allow 
performance of FISH mapping using shorter probes. 

35 A variety of nucleic acid amplification-based methods of genetic and 

physical mapping may be carried out using the instant nucleic acid sequences. 
Examples include allele-specific amplification, polymorphism of PCR-amplified 
fragments (CAPS), allele-specific ligation, nucleotide extension reactions, 
Radiation Hybrid Mapping and Happy Mapping. For these methods, the sequence 

19 



wo 00/18937 



PCTAJS98/20502 



of a nucleic acid fragment is used to design and produce primer pairs for use in the 
amplification reaction or in primer extension reactions. The design of such 
primers is well known to those skilled in the art. In methods employing PCR- 
based genetic mapping, it may be necessary to identify DNA sequence differences 
5 between the parents of the mapping cross in the region corresponding to the 
instant nucleic acid sequence. This, however, this is generally not necessary for 
mapping methods. Such information may be useftil in plant breeding in order to 
develop lines with desired starch phenotypes. 

EXAMPLES 

10 The present invention is further defined in the following Examples, in 

which all parts and percentages are by weight and degrees are Celsius, unless 
otherwise stated It should be xmderstood that these Examples, while indicating 
preferred embodiments of the invention, are given by way of illustration only. 
From the above discussion and these Examples, one skilled in the art can ascertain 

15 the essential characteristics of this invention, and without departing firom the spirit 
and scope thereof, can make various changes and modifications of the invention to 
adapt it to various usages and conditions. 
OFNERAL METHODS 

Standard recombinant DNA and molecular cloning techniques used m the 

20 Examples are well known in the art and are described by Sambrook, J. , Fritsch, 
E.F. and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring 
Harbor Laboratory Press: Cold Spring Harbor, (1989) (Maniatis) and by T. J. 
Silhavy, M. L. Bennan, and L. W. Enquist, Experiments with Gene Fusions, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1984) and by Ausubel, F. 

25 M. et al.. Current Protocols in Molecular Biology, pub. by Greene Publishing 
Assoc. and Wiley-Interscience (1987). 

EXAMPLE 1 

Composition of cDNA Libraries: Isolation and Seq uencing of cDNA Clones 
cDNA libraries representing mRNAs from various maize tissues were 
30 prepared. The characteristics of the libraries are described in Table 1 . 
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TABLE I 
cDNA Libraries From Com Tissues 



Library 


GST 
Class 


Clone 


Tissue 


bmsl 


I 


bmsl.pkO023.g8 


Maize BMS cell culture 1 day atter suocuinire 


csl 


I 


csl.pk0010.c5 


Maize leaf, sheath 5 wk plant Stratogene #837201 


cebl 


1 


cebl.pk00I7.a5 


Maize embryo 


cc71se 


III 


cc71se-a.pk000I.g2 


Maize class 11 callus tissue, somatic embryo, 
formed, highly transformable 


cc71se 


III 


cc71se-b.pk0014.b8 


Maize class II callus tissue, somatic embryo 
formed, highly transfonnable 


cebS 


III 


ceb5.pk0051.fB 


Amplified maize embryo 30 day 


crln 


III 


crln.pk0003.bl 


Maize root from 7 day seedlings grown in light 
nonnalized 


crln 


in 


crln.pk0014.g8 


Maize root from 7 day seedlings grown in light 
normalized 


m 


II 


m.l5.5.d06.sk20 


Maize 15 day embryo library 


crln 


11 


crln.pk0040.el2 


Maize root from 7 day seedlings grown in light 
normalized 


cebS 


III 


cebS.pk0049.all 


Amplified maize embryo 30 day 


csl 


III 


csl.pk00S9.e2 


Maize leaf, sheath S wk plant Stratogene #837201 



cDNA libraries were prepared in Uni-ZAP™ XR vectors according to the 
manufacturer's protocol (Stratagene Cloning Systems, La JoUa, CA). The 
Uni-ZAP™ XR libraries were converted into plasmid libraries according to the 
5 protocol provided by Stratagene. Upon conversion, cDNA inserts were contained 
in the plasmid vector pBluescript. cDNA mserts from randomly picked bacterial 
colonies containing recombinant pBluescript plasmids were amplified via 
polymerase chain reaction using primers specific for vector sequences flanking the 
inserted cDNA sequences. Amplified insert DNAs were sequenced in dye-primer 
10 sequencing reactions to generate partial cDNA sequences (expressed sequence 
tags or "ESTs"; see Adams, M. D. et al., (1991) Science 252:1651). The resulting 
ESTs were analyzed using a Perkin Ehner Model 377 fluorescent sequencer. 

EXAMPLE 2 
Identification and Characterization of cDNA Clones 
15 cDNAs encoding maize GST enzymes were identified by conducting 

BLAST (Basic Local Alignment Search Tool; Altschul, S. F., et al., (1993) J. Mol 
Biol. 215:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for 
similarity to sequences contained in the BLAST "nr" database (comprising all 
non-redundant GenBank CDS translations, sequences derived fi'om the 
20 3-dimensional structure Brookhaven Protein Data Bank, SWISS-PROT protein 
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sequence database, EMBL, and DDBJ databases). The cDNA sequences obtained 
in Example 1 were analyzed for similarity to all publicly available DNA 
sequences contained in the "nr" database using the BLASTN algorithm provided 
by the National Center for Biotechnology Information (NCBI). The DNA 

5 sequences v/ere translated in all readmg frames and compared for similarity to all 
publicly available protein sequences contained in the "nr" database using the 
BLASTX algorithm (Gish, W. and States, D. J. (1993) Nature Genetics 
5:266-272) provided by the NCBI. For convenience, the P-value (probability) of 
observing a match of a cDN A sequence to a sequence contained in the searched 

10 databases merely by chance as calculated by BLAST are reported herein as 
"pLog" values, w^hich represent the negative of the logarithm of the reported 
P-value. Accordingly, the greater the pLog value, the greater the likelihood that 
the cDNA sequence and the BLAST "hit" represent homologous protems. 
All comparisons were done using the BLASTNnr algorithm with the 

15 exception of crln.pk0040,el2 where BLASTXnr was used. The results of the 
BLAST comparison is given in Table 2 and summarizes the clones and the 
• sequences to which they have the most similarity. Each cDNA identified encodes 
at least a portion of either a GST class I, II, or HI. All isolated clones contain a 
full length open reading frame (ORF) with the exception of cc71se-a.pk0001 .g2 

20 which is only a partial clone. Example 5 describes the sequencing strategy for the 
above described clones. 

TABLE 2 
BLAST Results For Clones 

SEQ ID NO. 

GST Blast pLog 

Clone Class Similarity Identified Base Peptide Algoritfam Score 

bmsl.pk0023.g8 I X79515|ZMGST27 Z.mays 1 2 Nnr 122.086 

GST-27 mRNA for glutathione- 
s-transferase 

csl.pk0010.c5 I D17673IATHERD13 3 4 Nnr 8.16 

Arabidopsis thaliana mRNA for 
glutathione S-transferase 

cebl.pk0017.a5 I X78203|HMGST H.muticus 5 6 Nnr 21.51 

mRNA for glutathione S- 
transferase 

cc71se-a.pk0001.g2 III {AF0043 58) glutathione S- 7 8 Nnr 16.48 

transferase TSM (Aegilops 
squaiTOsa) 

cc71se-b.pk0014.b8 HI D10861|RICORFC Rice mRNA 9 10 Nnr 14.96 

for a protein related to chilling 
tolerance 

ceb5.pk0051.fB III D10861|RICOJIFC Rice mRNA 11 12 Nnr 40.44 

for a protein related to chilling 
tolerance 
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Clone 
crln.pkp003.bl 

crln.pk0014.g8 

m.l5.5.d06.sk20 

crln.pk0040.el2 
cebS.pk0049.all 

csLpk0059.e2 

EXAMPLES 
Expression of Chimeric Genes Encoding Maize 
5 GST Enzymes in Maize Cells (Monocotyledon) 

A chimeric gene comprising a cDNA encoding a maize GST enzyme in 
sense orientation can be constructed by polymerase chain reaction (PGR) of the 
cDNA clone using appropriate oligonucleotide primers. Cloning sites (Ncol or 
Smal) can be incorporated into tiie oligonucleotides to provide proper orientation 

10 of the DNA fragment when inserted into the digested vector pMLlOS as described 
below. Amplification is then performed in a 100 jiL volume in a standard PGR 
mix consisting of 0.4 mM of each oligonucleotide and 0.3 pM of target DNA in 
10 mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 mM MgCl2, 200 mM dGTP, 200 mM 
dATP, 200 mM dTTP, 200 mM dCTP and 0.025 unit DNA polymerase. 

15 Reactions are carried out in a Perkin-Ehner Cetus Thermocycler^** for 30 cycles 
comprismg 1 min at 95 °C, 2 min at 55 ^C and 3 min at 72 °C, with a final 7 min 
extension at 72 °C after the last cycle. The ampUfied DNA is then digested with 
restriction enzymes Ncol and Smal and fractionated on a 0.7% low melting point 
agarose gel in 40 mM Tris-acetate, pH 8.5, 1 mM EDTA. The appropriate band 

20 can be excised from the gel, melted at 68 °C and combined with a 4.9 kb 
Ncol-Smal fiagment of the plasmid pML103. Plasmid pML103 has been 
deposited under the terms of the Budapest Treaty at ATCC (American Type 
Culture Collection, 12301 Parklawn Drive, Rockville, MD 20852), and bears 

23 



SEQ ID NO. 

GST Blast pLog 

Class Similarity Identified Base Peptide Algorithm Score 

24,70 

III U80615IEGU80615 Eucalyptus 13 14 Nnr 
globulus auxin-induced protein 
(EgPar) mRNA, complete cds 

III M16901IM2EGSTIB Maize 15 16 Nnr 

glutathione S-transferase (GST- 
I) mRNA, complete cds 

II |M97702|DROGLUSTD 17 18 Nnr 

Drosophila melanogaster 
glutathione S-transferase gene 

II 167970 (L05915)(GST1) gene 19 20 Xnr 
produa (Dianthus caryophyllus) 

III |Y12862|ZYMY12862Zea 21 22 Nnr 
Maize riiRNA for glutathione 
S-transferase 

III D10861|RICORFCRice 24 25 Nnr 

mRNA for a protein related 
to chilling tolerance 



5.85 

3.63 

42.03 
0.0 

41.03 
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accession number ATCC 97366. The DNA segment from pML103 contains a 
1 .05 kb Sall-Ncol promoter fragment of the maize 27 kD zein gene and a 0.96 kb 
Smal-Sall fragment from the 3' end of the maize 10 kD zein gene m the vector 
pGem9Zf(+) (Promega Corp 7113 Benhart Dr, Raleigh, NC). Vector and insert 

5 DNA can be ligated at 1 5 °C overnight, essentially as described (Maniatis). The 
ligated DNA may then be used to transform E. coli XLl-Blue (Epicurian Coli 
XL-1; Stratagene). Bacterial transformants can be screened by restriction enzyme 
digestion of plasmid DNA and limited nucleotide sequence analysis using the 
dideoxy chain termination method DNA Sequencing Kit; U. S. Biochemical). The 

10 resulting plasmid construct would comprise a chimeric gene encoding, in the 5' to 
3' direction, the maize 27 kD zein promoter, a cDNA fragment encoding a plant 
GST enzyme, and the 10 kD zein 3' region. The chimeric gene described above 
can then be introduced into com cells by the following procedure. Immature com 
embryos can be dissected from developing caryopses draived from crosses of the 

15 mbred com lines H99 and LHl 32 (Indiana Agric. Exp. Station, Indiana, USA). 
The embryos are isolated 10 to 1 1 days after pollination when they are 1 .0 to 
1 .5 mm long. The embryos are then placed with the axis-side facing down and in 
contact with agarose-solidified N6 medium (Chu et al., (1975) Set Sin. Peking 
18:659-668). The embryos are kept in the dark at 27 *C. Friable embryogenic 

20 callus consisting of undifferentiated masses of cells with somatic proembryoids 
and embryoids borne on suspensor structures proliferates from the scutellum of 
these immature embryos. The embryogenic callus isolated from the primary 
explant can be cultured on N6 medium and sub-cultured on this medium every 2 
to 3 weeks. The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, v 

25 Frankfurt, Gennany) may be used in transformation experiments in order to 

provide for a selectable marker. This plasmid contains the Pat gene (see European 
Patent Publication 0 242 236) which encodes phosphinothricin acetyl transferase 
(PAT). The enzyme PAT confers resistance to herbicidal glutamine synthetase 
inhibitors such as phosphinothricin. The pat gene in p35S/Ac is under the control 

30 of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. (1985) Nature 
313:810-812) and the 3M region of the nopaline synthase gene from the T-DNA 
of the Ti plasmid of Agrobacterium tumefaciens. The particle bombardment 
method (Klein et al., (1987) Nature 327:70-73) may be used to transfer genes to 
the callus ctilture cells. According to this method, gold particles ((1 ym in 

35 diameter) are coated with DNA using the following technique. Ten ug of plasmid 
DNAs are added to 50 uL of a suspension of gold particles (60 mg per mL). 
Calcium chloride (50 ^L of a 2.5 M solution) and spermidine free base (20 of 
a 1.0 M solution) are added to tKe particles. The suspension is vortexed during the 
addition of these solutions. After 10 minutes, the tubes are briefly centrifuged 
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(5 sec at 15,000 ipm) and the supernatant removed. The particles are resuspended 
in 200 \iL of absolute ethanol, centrifuged again and the supernatant removed. 
The ethanol rinse is performed again and the particles resuspended in a final 
volume of 30 uL of ethanol. An aliquot (5 ^L) of the DNA-coated gold particles 

5 can be placed in the center of a flying disc (Bio-Rad Labs, 861 Ridgeview Dr, 
Medina, OH). The particles are then accelerated into the com tissue with a 
PDS-lOOO/He (Bio-Rad Labs, 861 Ridgeview Dr, Medina, OH), using a helium 
pressure of 1000 psi, a gap distance of 0.5 cm and a flying distance of 1.0 cm. For 
bombardment, the embryogenic tissue is placed on filter paper over agarose- 

10 solidified N6 medium. The tissue is arranged as a thin lawn and covered a 

circular area of about 5 cm in diameter. The petri dish containing the tissue can be 
placed in the chamber of the PDS-lOOO/He approximately 8 cm &om the stopping 
screen. The air in the chamber is then evacuated to a vacuum of 28 inches of Hg. 
The macrocairier is accelerated with a helium shock wave using a nqjture 

15 membrane that bursts when the He pressure in the shock tube reaches 1 000 psi. 
Seven days after bombardment the tissue can be transferred to N6 medium that 
contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue 
continues to grow slowly on this medium. After an additional 2 weeks the tissue 
can be transferred to fi«sh N6 medium containing gluphosinate. After 6 weeks, 

20 areas of about 1 cm in diameter of actively growing callus can be identified on 
some of the plates containing the glufosinate-supplemented medium. These calli 
may continue to grow when sub-cultured on the selective medium. Plants can be 
regenerated firom the transgenic callus by first transferring clusters of tissue to N6 
medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the tissue 

25 can be transferred to regeneration medium (Fromm et al., (1990) Bio/Technology 
8:833-839). 

EXAMPLE 4 

Expression of Chimeric Genes in Tobacco Cells r Picotvledon) 
Cloning sites (Xbal or Smal) can be incorporated into the oligonucleotides 

30 to provide proper orientation of the DNA Augment when inserted into the digested 
vector pBI121 (Clonetech Inc., 6500 Donlon Rd, Somis, CA) or other appropriate 
transformation vector. Amplification could be performed as described above and 
the amplified DNA would then be digested with restriction enzymes Xbal and 
Smal and fractionated on a 0.7% low melting point agarose gel in 40 mM Tris- 

35 acetate, pH 8.5, 1 mM EDTA. The appropriate band can be excised from the gel, 
melted at 68 ''C and combined with a 13 kb Xbal-Smal fragment of tiie plasmid 
pBI121 and handled as in Example 3. The resulting plasmid construct would 
comprise a chimeric gene encoding, in the 5' to 3' direction, right border region, 
the nos promoter linked to the NPT II gene and a nos terminator region followed 

25 
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by a cauliflower mosaic virus 35S promoter linked lo a cDNA tVaiimeni encoding 
a plant GST enzyme and the nos terminator 3' region flanked by the left border 
region. The resulting plasmid could be mobilized into the .h^rohacterium strain 
LBA4404/pAL4404 (Hoekema et al. Nature 303:179-1 80. (1983) using tri- 

5 parental matings (Ruvkin and Ausubel. Nature 289:85-88. (1981 )). The resulting 
Agrohacterium strains could be then cocultivated with protoplasts (van den Elzen 
et al. Plant MoL BioL 5: 149-154 (1985)) or leaf disks (Horsch et al. Science 
227:1229-1231, (1985)) ofNicotiana tabacum cv Wisconsin 38 and kanamycin- 
resistant transformants would be selected. Kanamycin-resistant transformed 

10 tobacco plants would be regenerated. 

EXAMPLE 5 
Expression Of Chimeric Genes In Microbial Cells 
And Purification Of Gene Product 
Example 5 illustrates the expression of isolated full length genes encoding 

15 either class I, II or III GST proteins in £. coli. 

All clones listed in Table 2 were selected on the basis of homology to 
known GSTs using the BLAST algorithm as described in Example 2. Plasmid 
DNA was purified using QIAFilter cartridges (Qiagen. Inc., 9600 De Soto Ave, 
Chatsworth, CA) according to the manufacturer's instructions. Sequence was 

20 generated on an ABI Automatic sequencer using dye terminator technology (U.S. 
5366860; EP 272007) using a combination of vector and insert-specific primers. 
Sequence editing was performed in either DNAStar (DNA, Star Inc.) or the 
Wisconsin GCG program (Wisconsin Package Version 9.0, Genetics Computer 
Group (GCG), Madison, WI). All sequences represent coverage at least two times 

25 in both directions. 

cDNA from the clones bmsl.pk0023.g8, csl.pk0010.c5, cebl.pk0017.a5, 
m.l5.5.dO6.sk20, ceb5.pk0049.all, ceb5.pk0051.f8, and csl.pk0059.e2, encoding 
the instant maize GST enzymes were inserted into the ligation independent 
cloning (LIC) pET30 vector (Novagen, Inc., 597 Science Dr, Madison, WI) under 

30 the control of the T7 promoter, according to the manufacturer's instructions (see 
Novagen publications "LIC Vector Kits", publication number TBI 63 and U.S. 
4952496). The vector was then used to transform BL21(DE3) competent £. coli 
hosts. Primers with a specific 3' extension designed for ligation independent 
cloning were designed to amplify the GST gene (Maniatis). Amplification 

35 products were gel-purified and annealed mto the LIC vector after treatment with 
T4 DNA polymerase (Novagen). Insert-containing vectors were then used to 
transform NovaBlue competent E. coli cells and transformants were screened for 
the presence of viable inserts. Clones in the correct orientation with respect to the 
T7 promoter were u-ansformed into BL21(DE3) competent cells (Novagen) and 

26 



wo 00/18937 



PCTAJS98/20502 



selected on LB agar plates containing 50 i-ig/mL kanamycin. Colonies arising 
from this transformation were grown overnight at 37 °C in Lauria Broth to OD 
600 = 0.6 and induced with I mM IPTG and allowed to grow for an additional 
two hours. The culture was har\ested. resuspended in binding buffer, lysed with a 

5 French press and cleared by centrifugation. 

Expressed protein was purified using the HIS binding kit (Novagen) 
according to the manufacturer's instructions. Purified protein was examined on 
15-20% SDS Phast Gels (Bio-Rad Laboratories, 861 Ridgeview Dr. Medina, OH) 
and quantitated spectrophotometrically using BSA as a standard. Protein data is 

10 tabulated below in Table 3. 

TABLE 3 
Protein Expression Data 



CLONE OD. 280 

binsl.pk0023.g8 0.57 

csl.pk0010.c5 0.53 

cebl.pk0017.a5 0.50 

m.l5.5.d06.sk20 0.39 

ceb5.pk0049.all 2.06 

ceb5.pk0051.ft 1-30 

csl.pk0059.e2 1-45 



EXAMPLE 6 

Screening Of Expressed GST Enzymes For Sub.s trate Metabolism 
15 The GST enzymes, expressed and purified as described in Example 5 were 

screened for their ability to metabolize a variety of substrates. Substrates tested 
included the three herbicide electrophilic substrates chlorimuron ethyl, alachlor, 
and Atrazine, and four model electrophilic substrates, l-chloro-2, 4-dinitro- 
benzene (CDNB), ethacrynic acid, t-stilbene oxide, and l,2-epoxy-3-(p-nitro- 
20 phenoxy) propane. The enzymes were purified as described in Example 5 and 
used in the following assay. 

For each enzyme, the conjugation reaction with each electrophilic 
substrate was performed by incubating 0.3 to 30 ^g enzyme in 0.1 M MOPS 
(pH 7.0) containing 0.4 mM of the electrophilic substrate. The reaction was 
25 inititated by the addition of glutathione to a final concentration of 4 mM. After 5 
to 30 min, the reaction was terminated by the addition of 45 \iL acetonitrile, 
microfuged for 10 min to remove precipitated protein, and then the supematent 
was removed and added to 65 ^l of water. This sample was chromatographed on 
a Zorbax C8 reverse phase HPLC column (3 \im particle size, 6.2 mm x 8 cm) 
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using a combination of linear gradients ( tlow = 1 .5 mL/min) of 1% H3PO4 in 
water (solvent A) and 1% H3PO4 in acetonitrile. The gradient started with 5% 
solvent B. progressing from 5% to 75% solvent B between 1 and 10 min. and 
from 75% to 95% solvent B between 10 and 12 min. Control reactions without 

5 enzyme were performed to correct for uncatalyzed reaction. Quantitation ot 
metabolites were based on an assumption that the extinction coefficient of the 
conjugate was identical to that of the electrophilic substrate. 

Table 4 shows the activity of each enzyme measured in nmol-min-^'mg"' 
with the seven different substrates. Activities are related to the activities of the 

10 known and previously isolated and purified GST enzymes, BZ-II (Marrs et al., 
Nature 375:397-400 (1995)), plN2-l (Hershey et al.. Plant Molecular Biology 
17:679-690, (1991)), GST-I, GST-III, and GST-IV, collectively described in Shah 
et al., Plant Mol Biol 6, 203-21 1(1986); Jepson et al.. Plant Mol Biol 
26:1855-1866, (1994); Moore et al.. Nucleic Acids Res Unin-lliS (1986); and 

15 Holt etal.,P/aAira 196:295-302, (1995). 



TABLE 4 



















l,2-epoxy-3- 






Chlor- 












(p-nitro- 




GST 


Imuron- 








Ethacrynic 


t-Stilbene 


phenoxy) 


GST Name 


Class 


Ethyl 


Alachlor 


Atrazine 


CDNB 


Acid 


Oxide 


propane 


csl.pk0059.e2 


III 


0.1 


8 


0.02 


1348 


20 


125 


43 


cebS.pk0049.al 1 


III 


0.4 


18 


0.01 


3939 


102 


0.01 


30 


cebS.pk00Sl.f8 


III 


1.9 


27 


0.08 


2136 


117 


0.02 


14 


BZ-II 


in 


0.2 


0 


0.00 


IS 


23 


0.05 


0 


cebI.pk0017.aS 


I 


0.1 


0 


0.00 


15 


5 


0.00 


0 


csl.pk0010.cS 


I 


0.1 


0 


0.00 


30 


9 


0.00 


0 


binsl.pk0023.g8 


I 


0.2 


0 


0.00 


15 


13 


0.00 


0 


GST-IV 


I 


0.3 


1 


0.00 


15 


13 


0.00 


0 


GST-I 


I 


0.4 


77 


0.60 


46485 


32 


0.98 


92 


GST-III 


I 


0.3 


3 


0.05 


1803 


1 


0.31 


28 


m.lS.S.d06.sk20 


II 


0.1 


0 


0.00 


45 


17 


0.00 


1 


pIN2-l 


IV 


0 


0 




15 
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What is claimed is: 

1 . An isolated nucleic acid fragment encoding a maize GST enzyme 
selected from the group consisting of: 

(a) an isolated nucleic acid fragment encoding all or a substantial 
5 portion of the amino acid sequence selected from the group 

consisting of SEQ ID N0:2. SEQ ID N0:4. SEQ ID N0:6. SEQ 
ID N0:8. SEQ ID NO: 10. SEQ ID NO: 12. SEQ ID NO: 14. SEQ 
ID NO: 1 6. SEQ ID NO: 18. SEQ ID NO:20. SEQ ID NO:22, and 
SEQ ID NO:24; 

10 (b) an isolated nucleic acid fragment that is substantially similar to an 

isolated nucleic acid fi-agment encoding all or a substantial 
portion of the amino acid sequence selected from the group 
consisting of SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:6, SEQ 
ID N0:8, SEQ ID NO:10, SEQ ID N0:12, SEQ ID N0:14, SEQ 

15 ID NO: 1 6, SEQ ID NO: 1 8, SEQ ID NO:20, SEQ ID NO:22, and 

SEQ IDNO:24;and 
(c) an isolated nucleic acid fragment that is complementary to (a) or 

(b). 

2. The isolated nucleic acid fragment of Claim 1 selected from the group 
20 consisting of SEQ ID NO: 1 , SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, SEQ 

ID N0:9, SEQ ID N0:11, SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, SEQ 
ID N0:19, SEQ ID N0:21, and SEQ ID NO:23. 

3 . A polypeptide encoded by the isolated nucleic acid fragment of 

Claim 1. 

25 4. The polypeptide of Claim 3 selected from the group consisting of SEQ 

ID N0:2, SEQ ID N0:4, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO: 10, SEQ ID 
N0:12, SEQ ID N0:14, SEQ ID N0:16, SEQ ID N0:18, SEQ ID NO:20, SEQ ID 
N0:22,andSEQIDN0:24. 

5. A chimeric gene comprising the isolated nucleic acid firagment of 
30 Claim 1 operably linked to suitable regulatory sequences. 

6. A transformed host cell comprising a host cell and the chimeric gene 
of Claim 5. 

7. The o-ansformed host cell of Claim 6 wherein the host cell is a plant 

cell. 

35 8. The transformed host cell of Claim 6 wherein the host cell is £ coli. 

9, A method of altering the level of expression of a maize GST enzyme 
in a host cell comprising: 

(a) transforming a host cell with the chimeric gene of Claim 5 and: 
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(b) growinii the transformed host cell produced in step (a) under 
conditions that are suitable for expression of the chimeric gene 

resultin in production of altered levels of a maize GST enzyme in the transformed 
host cell relative to expression levels of an untransformed host cell. 
5 10. A method of obtaining a nucleic acid fragment encoding all or a 

substantial portion of the amino acid sequence encoding a maize GST enzyme 
comprising: 

(a) probing a cDNA or genomic library with the nucleic acid 
fragment of Claim 1: 

10 (b) identifying a DNA clone that hybridizes with the nucleic acid 

fragment of Claim 1 ; and 

(c) sequencing the cDNA or genomic fragment that comprises the 
clone identified in step (b), 

wherein the sequenced cDNA or genomic fragment encodes ail or substantially all 
15 of the amino acid sequence encoding a maize GST enzyme 

11. A method of obtaining a nucleic acid fragment encoding all or a 
substantial portion of the amino acid sequence encoding a maize GST enzyme 
comprising: 

(a) synthesizing an oligonucleotide primer corresponding to a portion 
20 of the sequence selected from the group consisting of SEQ ID 

N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, SEQ ID 
N0:9, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID 
NO: 17, SEQ ID NO: 19, SEQ ID N0:21, and SEQ ID NO:23; 

(b) amplifying a cDNA insert present in a cloning vector using the 
25 oligonucleotide primer of step (a) and a primer representing 

sequences of the cloning vector, 
wherein the amplified cDNA insert encodes a portion of an amino acid sequence 
encoding a maize GST enzyme. 

1 2. The product of the method of Claim 1 0 or 1 1 . 

30 1 3 . A method for identifying a chemical compound that inhibits the 

activity of a maize GST enzyme encoded by the nucleic acid fragment of Claim 1, 
the method comprising the steps of: 

(a) transforming a host cell with a chimeric gene comprising the 
nucleic acid fragment of Claim 1 encoding a maize GST enzyme, 

35 the chimeric gene operably linked to at least one suitable 

regulatory sequence; 

(b) growing the transformed host cell of step (a) under conditions 
suitable for expression of the chimeric gene resulting in 
production of the GST enzyme; 
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(c) optionally purifying the GST enzyme expressed by the 
transformed host cell: 

(d) contacting the GST enzyme with a chemical compound of 
interest: and 

5 (e) identifying the chemical compound of interest that reduces the 

activity of the maize GST enzyme relative to the activity of the 
maize GST enzyme in the absence of the chemical compound of 
interest. 

14. The method of Claim 13 wherein step (d) is carried out in the 
10 presence of at least one electrophilic substrate and at least one thiol donor. 

1 5. The method of Claim 1 3 wherein said nucleic acid fragment is 
selected from the group consisting of SEQ ID N0:1, SEQ ID N0:3, SEQ ID 
N0:5, SEQ ID N0:9, SEQ ID NO: II , SEQ ID NO: 1 3, SEQ ID NO: 1 5, SEQ ID 
N0:17, SEQ ID N0:19, SEQ ID N0:21, and SEQ ID NO:23, and wherein the 

15 maize GST enzyme is selected from the group consisting of SEQ ID N0:2, SEQ 
ID N0:4, SEQ ID N0:6, SEQ ID NO: 10, SEQ ID N0:12, SEQ ID N0:14, SEQ 
ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, and SEQ ID NO:24. 

16. A method for identifying a chemical compound that inhibits the 
activity of a maize GST enzyme encoded by the nucleic acid fragment of Claim 1, 

20 the method comprising the steps of: 

(a) transforming a host cell with a chimeric gene comprising a 

nucleic acid fragment of Claim 1 encoding a maize GST enzyme, 
the chimeric gene operably linked to at least one regulatory 
sequence; 

25 (b) growing the transformed host cell of step (a) under conditions 

suitable for expression of the chimeric gene resulting in 
production of the GST enzyme; 
(c) contacting the transformed host cell of step (b) with an inhibitor 
candidate; and 

30 (d) comparing the phenotype of the transformed host cell contacted 

with an inhibitor candidate with the phenotype of the transformed 
host cell that was not contacted with an inhibitor candidate to 
identify the chemical compound that inhibits the activity of the 
soybean GST enzyme. 
35 1 7. The method of Claim 16 wherein the nucleic acid fragment is selected 

from the group consisting of SEQ ID NO: 1, SEQ ID N0:3. SEQ ID N0:5, SEQ 
ID N0:9, SEQ ID N0:1 1, SEQ ID NO: 13, SEQ ID NO: 15. SEQ ID NO: 17. SEQ 
ID N0:19, SEQ ID N0:21, and SEQ ID NO:23, and wherein the maize GST 
enzyme is selected from the group consisting of SEQ ID N0:2, SEQ ID N0:4, 
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SEQ ID N0:6, SEQ ID NO: 1 0. SEQ ID NO: 1 2, SEQ ID NO: 14, SEQ ID NO: 1 6, 
SEQ ID NO: 1 8, SEQ ID NO:20. SEQ ID NO:22, and SEQ ID NO:24. 

1 8. A method for identifying a substrate for a GST enzyme, the GST 
enzyme encoded by the isolated nucleic acid fragment of Claim 1. the method 

5 comprising the steps of: 

(a) transforming a host cell with a chimeric gene comprising an 
isolated nucleic acid fragment of Claim I encoding a maize GST 
enzyme, the chimeric gene operably linked to at least one suitable 
regulatory sequence: 

10 (b) growing the transformed host cell of step (a) under conditions 

suitable for expression of the chimeric gene resulting in 
production of the soybean GST enzyme; 
(c) optionally purifying the GST enzyme expressed by the 
transformed host cell; 

15 (d) contacting the soybean GST enzyme with a substrate candidate; 

and 

(e) comparing the activity of soybean GST enzyme that has been 
contacted with the substrate candidate with soybean GST enzyme 
that has not been contacted with the substrate candidate, 
20 selecting substrate candidates that increase the activity of the soybean GST 
enzyme relative to the activity of the soybean GST enzyme in the absence of a 
substrate candidate. 

1 9. The method of Claim 1 8 wherein step (d) is carried out in the 
presence of at least one thiol donor. 

25 20: The method of Claim 1 8 wherein the nucleic acid fragment is selected 

from the group consisting of SEQ ID NO: 1 , SEQ ID N0:3, SEQ ID N0:5, SEQ 
ID N0:9, SEQ ID N0:1 1, SEQ ID N0:13, SEQ ID N0:15, SEQ ID N0:17, SEQ 
ID N0:19, SEQ ID N0:21, and SEQ ID NO:23, and wherein the maize GST 
enzyme is selected from the group consisting of SEQ ID N0:2, SEQ ID N0:4, 

30 SEQ ID N0:6, SEQ ID NO:10, SEQ ID N0:12, SEQ ID N0:14, SEQ ID N0:16, 
SEQ ID NO: 1 8. SEQ ID NO:20, SEQ ID NO:22, and SEQ ID N0:24. 

21. A method for identifying a substrate for a maize GST enzyme, the 
method comprising the steps of: 

(a) transforming a host cell v^ath a chimeric gene comprising a 
35 nucleic acid fragment of Claim 1 , the chimeric gene operably 

linked to at least one suitable regulatory sequence; 

(b) growing the transformed host cell of step (a) under conditions 
suitable for expression of the chimeric gene resulting in 
production of the maize GST enzyme; 
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(c) contacting the transformed host cell of step (b) with a maize GST 
substrate candidate: and 

(d) comparing the phenotype of the transformed host cell contacted 
with the substrate candidate with the phenotype of the 

5 transformed host cell that was not contacted with the substrate 

candidate to identify a soybean GST enzyme substrate. 
22. The method of Claim 2 1 wherein the nucleic acid fragment is selected 
from the group consisting of SEQ ID NO: K SEQ ID N0:3. SEQ ID N0:5, SEQ 
ID N0:9, SEQ ID NO: 1 K SEQ ID NO: 1 3, SEQ ID NO: 1 5, SEQ ID NO: 1 7, SEQ 
10 ID NO: 19, SEQ ID N0:21, and SEQ ID NO:23, and wherein the maize GST 
enzyme is selected from the group consisting of SEQ ID N0:2, SEQ ID N0:4, . 
SEQ ID N0:6, SEQ ID NO:10, SEQ ID N0:12, SEQ ID NO:14, SEQ ID N0:16, 
SEQ ID N0:18, SEQ ID NO:20, SEQ ID N0:22, and SEQ ID N0:24. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) ADDRESSEE: E.I. DU PONT DE NEMOURS AND COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITY: WILMINGTON 

(D) STATE: DELAWARE 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) ZIP: 19898 

(G) TELEPHONE: 302-892-7229 

(H) TELEFAX: 302-773-0164 

(I) TELEX: 6717325 

(ii) TITLE OF INVENTION: PLANT GLUTAT HI ONE- S -TRANSFERASE 

ENZYMES 

(iii) NUMBER OF SEQUENCES: 24 

liv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.50 INCH 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: MICROSOFT WINDOWS 95 

(D) SOFTWARE: MICROSOFT WORD VERSION 7. OA 

(V) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) ATTORNEY /AGENT INFORMATION: 

(A) NAME: KAREN K. KINB 

(B) REGISTRATION NUMBER: 34,850 

(C) REFERENCE/DOCKET NUMBER: CL-1128 
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(2) IMFORNIATIOM FOR oEQ ID N*0:i: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8«J4 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNES3: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: -DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: BMSl . PK0023 . G8 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CGGCACGAGC AATGGCGCCG CCGATGAAGG TGTACGGGTG GGCCGTGTCG CCGTGGATGG 60 

CGCGCGCGCT GGTGTGTCTG GAGGAGGCCG GCGCCGACTA CGAGATCGTC CCCATGAGCA 120 

GGTGTGGCGG CGACCACCGC CGGCCGGAGC ACCTCGCCAA AAACCCGTTC GGTGAAATCC 180 

CAGTTTTAGA GGACGGTGAT CTCACGCTCT ACCAATCACG CGCCATCGCA CGGTACGTCC 240 

TCCGCAAGCT CAAGCCAGAG CTCCTCCGCG AAGGCGACCT CGAGGGGTCG GCGATGGTGG 300 

ACGCGTGGAT GGAGGTGGAA GCCCACCACA TGGAGCCGGC CCTGTGGCCC ATCATCCGCC 360 

ACAGCATCAT CGGCCAGTAC GTCGGCCGCG AGCGCGACCA CCAGGCCGTC ATCGACGAGA 420 

ACCTCGACAG GCTGAGGAAG GTGCTGCCGG CGTACGAGGC GAGGCTGTCC GTCTGCAAGT 480 

ACCTGGTGGG GGACGACATC AGCGCCGCCG ACCTCTGCCA CTTCGGCTTC ATGCGCTACT 54 0 

TCATGGCCAC GGAGTACGCC GGCTTGGTGG ACGCGTACCC GCACGTCAAG GCCTGGTGGG 600 

ACGCGCTGCT GGCGAGGCCC TCGGTGCAGA AGGTCATGGC AGGCATGCCG CCGGATTTTG 660 

GGTACGCCAG CGGGAACATA CCATAGGCTA GAAGCGGTGG GCGTCCGTCA TTCTGCAGAT 720 

CTGAGGTCTC TGAACCTCAG CGTTTCCGAT AAACATGCAT GCTTTATGTA CTGTTTAAAA 730 

AACAAACCTG ATTGGTGCAG GGTATTTTAG TCCTCTTAAA AAAAAAAAAA AAAAAAAAAA 840 
AAAA 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
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. C) STRAMDEDNES3 : not reieva.i- 
ID) TOPOLOGY: r.oc relevan?: 

MOLECULE TYPE: crocein 

!vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

ivii) IMMEDIATE SOURCE: 

(3) CLONE: BMSl . PK0023 . G3 

;xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Pro Pro Met Lys Val Tyr Gly Trp Ala Val Ser Pro Tr? Met 
; 5 10 15 

Ala Arg Ala Leu Val Cys Leu Giu Glu Ala Gly Ala Asp Tyr Glu He 
20 25 30 

Val Pro Met Ser Arg Cys Gly Gly Asp His Arg Arg Pro Glu His Leu 
35 40 45 

Ala Lys Asn Pro Phe Gly Glu He Pro Val Leu Glu Asp Gly Asp Leu 
50 55 60 

Thr Leu Tyr Gin Ser Arg Ala He Ala Arg Tyr Val Leu Arg Lys Leu 
65 70 75 80 

Lvs Pro Glu Leu Leu Arg Glu Gly Asp Leu Glu Gly Ser Ala Met Val 
85 90 95 

Asp Ala Trp Met Glu Val Glu Ala His His Met Glu Pro Ala Leu Trp 
100 105 110 

Pro He He Arg His Ser He He Gly Gin Tyr Val Gly Arg Glu Arg 
115 120 125 

Asp His Gin Ala Val He Asp Glu Asn Leu Asp Arg Leu Arg Lys Val 
130 135 140 

Leu Pro Ala Tyr Glu Ala Arg Leu Ser Val Cys Lys Tyr Leu Val Gly 
145 150 155 160 

Asp Aso He Ser Ala Ala Asp Leu Cys His Phe Gly Phe Met Arg Tyr 
165 170 175 

Phe Met Ala Thr Glu Tyr Ala Gly Leu Val Asp Ala Tyr Pro His Val 
180 185 190 

Lys Ala Trp Trp Asp Ala Leu Leu Ala Arg Pro Ser Val Gin Lys Val 
195 200 205 

Met Ala Gly Met Pro Pro Asp Phe Gly Tyr Ala Ser Gly Asn He Pro 
210 215 220 
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(2) IMrORMATION FOB SEQ ID MO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 
(3) TYPE: nucleic acid 

{C) STRAMDEDNES3: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) . IMMEDIATE SOURCE: 

(B) CLONE: CS.PK0010.C5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATCACCATTC TTCATCCCTC GTTGTCATCT CACAGCTTGG GCTAGAGACC AAACAAACCA 60 

AAGGGAAGCA TGGCAGCAGG CCTGCAAGTG TTTGGCCAGC CGGCGTCTAC TGATGTTGCC 120 

AGGGTTCTGA CGTGCCTGTT TGAGAAGAAA TTGGAGTTTG AGCTTGTCCG CATTGATACA 180 

TTTAAGACAC ATCACAGGCT TCCTGAGTTC ATCAGGCTGC GGGATCCGAA TGGGCAAGTG 240 

ACCTTCAAGC ATGGCGACAA AACCCTTGTT GATTCAAGGG ACATATGCCG GTACGTTTGC 300 

AACCAGTTTC CAAATTACGG AAACAAGAGC CTCTATGGAT CTGGTGCTCT AGAACGGGCA 360 

TCGATAGAAC AGTGGCTCCA GGCAGAAGCC CAGAACTTTG GCCCTCCCAG CTCTGCGCTT 420 

GTGTTTCAGC TGGCGTTCGT TCCGCACCTC AGTCACCTGG GCGTTCGTCA GGACCCTGCT 480 

GTTATTGCTG AAAACGAGGA CAAACTGAAG CAGGTTCTTG ATGTTTACGA CGAAATACTC 540 

TCCAAGAACG AGTACCTGGC TGGTGATGAG TTCACCCTGG CCGACCTGTC TCACCTTCCG 600 

AACTCGCACT ACATCGTAAA CACCGAGAGA GGAAGGAAGC TCTTCACCAA CAAGAAGAAT 660 

GTGGCGAAAT GGTATGACAG GCTCTCGAAG CGCGAGACAT GGGTGCAGGT CGTCAAGATG 720 

CAGAAGGAAC ATCCTGGTGC GTTCAAGTAA TGGCTTGTCT TGGGGAGTTG TGAGTATGGC 730 

TTCATCGTCC GTGTTGGTCT GGCTCATCAG TGTTAAAAGC CCATCAGTGT CGTCAACCAG 840 

AATAATGTGA AGCCCAACTG TGATGTATGG TCTTTTTTTT TTAAAAGCGC ATTTGTAAAC 900 

TATTGGCTAT TTCTTGCACG TGCCAATTCA TCGTCACATA TAAAATAAJ^C TGTATCTTTG 960 
ACCTTGTGTC ATGTACGCAA AAAAAAAAAA AAAAAAAAA 
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;2) IMrORt4ATION FOR SEQ ID NO: 4: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 
(3) TYPE: amino acid 
(C5 3TRANDEDNES3: not relevant 
■;0' TOPOLOGY: not relevanc 



(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(D TISSUE TYPE: MAIZE 

{vii) IMMEDIATE SOURCE: 

(A) LIBRARY: . CS . PKOOlO . C5 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Ala Ala Gly Leu Gin Val Phe Gly Gin Pro Ala Ser Thr Asp Val 
1 5 10 15 

Ala Arg Val Leu Thr Cys Leu Phe Glu Lys Lys Leu Glu Phe Glu Leu 
20 25 30 

Val Arg He Asp Thr Phe Lys Thr His His Arg Leu Pro Glu Phe He 
35 40 45 

Arg Leu Arg Asp Pro Asn Gly Gin Val Thr Phe Lys His Gly Asp Lys 
50 55 60 

Thr Leu Val Asp Ser Arg Asp He Cys Arg Tyr Val Cys Asn Gin Phe 
65 70 75 80 

Pro Asn Tyr Gly Asn Lys Ser Leu Tyr Gly Ser Gly Ala Leu Glu Arg 
85 90 95 

Ala Ser He Glu Gin Trp Leu Gin Ala Glu Ala Gin Asn Phe Gly Pro 
100 105 HO 

Pro Ser Ser Ala Leu Val Phe Gin Leu Ala Phe Val Pro His Leu Ser 
115 120 125 

His Leu Gly Val Arg Gin Asp Pro Ala Val He Ala Glu Asn Glu Asp 
130 135 140 

Lys Leu Lys Gin Val Leu Asp Val Tyr Asp Glu He Leu Ser Lys Asn 
145 150 155 160 

Glu Tyr Leu Ala Glv Asp Glu Phe Thr Leu Ala Asp Leu Ser His Leu 
165 170 175 

Pro Asn Ser His Tyr He Val Asn Thr Glu Arg Gly Arg Lys Leu Phe 
180 185 190 

Thr Asn Lys Lys Asn Val Ala Lys Trp Tyr Asp Arg Leu Ser Lys Arg 
195 200 205 
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Giu Th- T-o Val -l^lr; Vai Vai Lvs Met Gin Lv3 ^l*-: HLo Pro Giy Aia 
210 ■ 215 :20 

Phe Lys 
225 

(2) INFORMATION FOR SEQ ID MO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 
(3; TYPE: nucleic acid 

{Ci STRANDEDNESS: single 
(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: cDNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CEBl . PK0017 . A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CCCCAGCGGC GGCGAGGCGA TGGCGGCGCC TGTGACGGTG TACGGACCGA TGCTCTCACC 60 

AGCTGTGGCC CGCGTGGCGG CCTGCCTCCT GGAGAAGGAC GTGCCGTTCC AGATCGAGCC 120 

GGTGGACATG TCCAAGGGCG AGCACAAGTC GCCGTCCTTC CTCAAGCTCC AGCCCTTCGG 180 

ACAGGTCCCT GCCTTCAAGG ACCACCTCAC AACCGTCTTT GAGTCAAGGG CTATTTGCCG 240 

TTACATATGC GACCAGTATG CGGACTCTGG TAATCAGGCC CTCTTCGGCA AGAAAGAAGA 300 

CGGCGCGGTT GGCCGCGCTG CCATTGAACA GTGGATAGAG TCTGAAGGCC AGAGCTTTAA 360 

CCCACCGAGC TTGGCTATTA TCTTCCAGCT CGCATTTGCA CCGATGATGG GCCGGACCAC 420 

TGACCTGGCT GTGGTTGAGC AAATGAAGCG AAGCTTGCGA AGGTGCTTGA TGTGTATGAC 4 80 

CAACGGCTGG GGGAGAGCCA GTATTTTGCT GGTGATGATT TCTCCCCTGG CCGACCTTGT 540 

GCACTTGCCC AATGCAGATT TCCTTGTGAA CAGAACCAGC AAGGCTGGCT TGATCACCGA 600 

GAGAAAGAAT CTTGCTAGAT GGTGGGATGA TGTCTCGTCC CGACCTGCAT GGAAAAAGGT 660 

CACTGAGATG CAGAGCACGC CGAGGCCCTC TTAGAGCTTT TTTTTGGGTT TCTTTGAGCA 720 

GCTTCTGATG GCAATTAGTT GCATTCTCCT TGTTTTGTCA TCAAGTCCTT GTCTGTACCG 780 

TTTCCTGTTC TCTTATTTAT CGGTCTTAAT TCTTGATCTA TGTATGGTTT GGATCTGTTC 84 0 

TTCTGGTCCT TTAGTTTATA TAAGTACCTA CAATTCTTCA AAAAAAAAAA AAAAAAAAAA 900 
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(2) INFORMP.TIOM FOR 3EQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(3) CLONE: CEBl . PK0017 . A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Ala Pro Val Thr Val Tyr Gly Pro Met Leu Ser Pro Ala Val 
1 5 10 15 

Ala Arg Val Ala Ala Cys Leu Leu Glu Lys Asp Val Pro Phe Gin He 
20 25 30 

Glu Pro Val Asp Met Ser Lys Gly Glu His Lys Ser Pro Ser Phe Leu 
35 40 45 

Lys Leu Gin Pro Phe Gly Gin Val Pro Ala Phe Lys Asp His Leu Thr 
50 55 60 

Thr Val Phe Glu Ser Arg Ala He Cys Arg Tyr He Cys Asp Gin Tyr 
65 70 75 80 

Ala Asp Ser Gly Asn Gin Ala Leu Phe Gly Lys Lys Glu Asp Gly Ala 
85 90 95 

Val Gly Arg Ala Ala He Glu Gin Trp He Glu Ser Glu Gly Gin Ser 
100 105 HO 

Phe Asn Pro Pro Ser Leu Ala He He Phe Gin Leu Ala Phe Ala Pro 
115 120 125 

Met Met Gly Arg Thr Thr Asp Leu Ala Val Val Glu Gin Asn Glu Ala 
130 135 140 

Lys Leu Ala Lys Val Leu Asp Val Tyr Asp Gin Arg Leu Gly Glu Ser 
145 150 155 160 

Gin Tyr Phe Ala Gly Asp Asp Phe Ser Pro Gly Arg Pro Cys Ala Leu 
165 170 175 

Ala Gin Cys Arg Phe Pro Cys Glu Gin Asn Gin Gin Gly Trp Leu Asp 
180 185 190 

His Arg Glu Lys Glu Ser Cys 
195 
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(2) IMrORMATIOM FOR 3ZQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 base pairs 
(3) TYPE: nucleic acid 

{C) 3TRANDEDNE33: Single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

{iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

ivi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CC71SE-A. PKOOOl .G2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCGCGTCGGA GGAGCTCCAC GGCGTCAGGC CCTTCGACCC CGAGCGGACT CCGCTGCTGG 60 

CGGCGTGGTC GGAGCGCTTC GGCGCGCTGG ATGCCGTCCA GACGGTGATG CCCGACGTCG 120 

GCAGGCTGCT CGAGTTCGGC AAGGCGTTGA TGGCACGTCT GGCGGCTGCG GCCGCCGCCG 180 

GTGCAAGCAA TAACTGAAGA GGGCATGGTG TATCCGTCAT GTGTTTCAGG TTTTCGTATA 240 

GTGAACAAAA AAGGAAAAAA TAATGCTAGC TACGCATCGG AACGCGGCTT TGTGCTTTGC 300 

CGTCTCGCCG TTAGTTCAGC TTATGTGATG TGAGTGTTGC CGTGCATGTG TGTGTTACTT 360 

CAGATGTATC CTGTTCGGTT CAGTGATTAT ATGGAACATT TTATTTTGGT TGGATAAAAA 420 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CC71SE-A. PKOOOOl . G2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Ser Glu Glu Leu His Gly Val Arg Pro Phe Asp Pro Glu Arg Thr 
15 10 15 
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Pro Leu Leu Ala Aia Trp Ser GLi Arg Phe Giy Aia leu Asp Aia Vai 
20 25 30 

Gin Thr Vai Met Pro Aso Val Glv Arg Leu Leu Giu Phe Giy Lys Aia 
35 " 4: -35 

Leu Met Aia Arg Leu Aia Ala Ala Aia Ala Pro Vai 31r. Ala lie Thr 
50 55 60 

Giu Glu Giy Met Val Tyr Pro Ser Cys Vai Ser Giy Phe Arg lie Vai 
65 70 75 80 

Asn Lys Lys Giy Lys Asn Asn Ala Ser Tyr Ala Ser Giu Arg Giy Phe 
35 90 95 

Val Leu Cys Arg Leu Ala Val Ser Ser Aia Tyr Val Met 
100 105 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CC71SE-B . PK0014 . B8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCAAGGTCGA CATGTCGTCT CCGCCGCCGG TGAAGCTGAT CGGCTTCTTC GGCAGCCCGT 60 

ACGCGTTCCG CGCGGAGGCG GCGCTGTGCC TGAAAGGCGT GCCGTACGAG CTGATCCTGG 120 

AGGACCTGTT CGGCAGCAAG AGCGAGCTCC TGCTCCACCA CAACCCCGTG CACAAGAAGG 180 

TGCCCGTGCT CCTCCACGGC GACGGCCGGG CCATCTCCGA GTCCCTCGTC ATCGCCGAGT 240 

ACGTCGACGA GGCCTTCGAC GGGCCGCCGC TGCTCCCCGC CGACCCCTAC GCGCGCGCCG 300 

CCGCCCGCTT CTGGGCCGAC TTCATCGAGA CCAGGCTCAC CAAGCCCTTC TTCATGGCGA 360 

TCTGGGTGGA GGAGCGCGAC GCGCGGCTGC GGTTCGAGGA GGAGGCCAAG GAGCTCGTGG 420 

CGCTGCTGGA GGCGCAGCTC GAGGG/^AAGA GGTTCTTCGC CGGCGACAGG CCGGGGTACC 4 80 

TCGACGTGGC CGCGTCCGCG CTCGGGCCCT GGCGCAGCGT CATCGAGGAG CTCAACGGTG 54 0 
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TGGCGCTGCT CAGCGAGGAT GACCACCOCA ACCTGTGCCG GTGGACCAGG GACTACTGCG 600 

CCTTCGAGGC TCTCAAGCCG TGCATGCCGG ATCGGGAGAA GCTCCTCGCC TACTTCACTA 660 

AGAACTTCGA CAGGTACAAG GCGGCCGTCA ATGCGACGCT ATCGCAGTCG CAGCAGTAAT 720 

AACTGCCCAA CTGGGTACGC CTCTGCCCGG CCGTATGGCG GGCGTTTCTT TTTTTCTTTC 780 

TTCAGAATAA CGTAGCTGTG CCCAGTACTC ATGTTTTCAA TTCTGCAAJ^G TGCAAACCAA 84 C 

CAAGTCGCTG TGTGGTTTAC TCTT7TTAAA AAAAAAAAAA AAAAAAAJVAA AAAPAAAAA.-. 900 
AAAAAAAAAA A . 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CC7 ISE-B. PK0014 . B8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Ser Pro Pro Pro Val Lys Leu He Gly Phe Phe Gly Ser Pro 
15 10 15 

Tyr Ala Phe Arg Ala Glu Ala Ala Leu Cys Leu Lys Gly Val Pro Tyr 
20 25 30 

Glu Leu He Leu Glu Asp Leu Phe Gly Ser Lys Ser Glu Leu Leu Leu 
35 • 40 45 

His His Asn Pro Val His Lys Lys Val Pro Val Leu Leu His Gly Asp 
50 55 60 

Gly Arg Ala He Ser Glu Ser Leu Val He Ala Glu Tyr Val Asp Glu 
65 70 75 30 

Ala Phe Asp Gly Pro Pro Leu Leu Pro Ala Asp Pro Tyr Ala Arg Ala 
85 90 95 

Ala Ala Arg Phe Trp Ala Asp Phe He Glu Thr Arg Leu Thr Lys Pro 
100 105 110 

Phe Phe Met Ala He Trp Val Glu Glu Arg Asp Ala Arg Leu Arg Phe 
115 120 125 
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Giu Giu Giu Aia Lys Giu Leu Vai Ala Leu Leu Giu Ala Gin Leu Giu 
130 135 140 

Giy Lys Arg Phe Phe Aia Giy Asp Arg Pro Gly Tyr Leu Asp Vai Ala 
145 150 155 160 

Aia Ser Ala Leu Giy Pro Trp Arg Ser Vai He Giu Giu Leu Asn Giy 
165 170 175 

Vai Aia Leu Leu Ser Giu Asp Asp His Pro Asn Leu Cys Arg Trp Thr 
180 185 190 

Arg Aso Tyr Cys Ala Phe Giu Aia Leu Lys Pro Cys Met Pro Asp Arg 
195 200 205 

Giu Lys Leu Leu Ala Tyr Phe Thr Lys Asn Phe Asp Arg Tyr Lys Ala 
210 215 220 

Ala Val Asn Ala Thr Leu Ser Gin Ser Gin Gin 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: maize 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: ceb5 . pk0051 . f 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGCGCATGCA GGTAGCAATG GCGGGGGAGA CGAAGAAGGG CCTGGTGCTG CTGGACTTCT 60 

GGGTGAGCCC GTTCGGGCAG CGCTGCCGCA TCGCGCTGGC GGAGAAGGGC ATCGCCTACG 120 

AGTACTCGGA GCAGGAGCTG CTGGGCGGCG CCAAGAGCGA CATCCTCCTC CGCTCCAACC 180 

CGGTGCACAA GAAGATCCCC GTGCTCCTCC ACGACGGCCG CCCCGTCTGC GAGTCCCTCG 240 

TCATCCTCGA GTACCTCGAG GAGGCCTTCC CGGAGGCCTC CCCCAGGCTG CTCCCCGACG 300 

CCGCCTACGC GCGCGCGCAG GCCCGCTTCT GGGCGGCCTA CTCCGACAAG GTCTACAAGG 360 

CCGGCACGCG GCTGTGGAAG CTCAGGGGCG ACGCGCGGGC GCAGGCGCGC GCCGAGATCG 420 

TGCAGGTGGT CCGGAACCTC GACGGCGAGC TAGGGGACAA GGCCTTCTTC GGCGGCGAGG 4 80 
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CGTTCGGGTT CGTGGACGTG GCGCTCGTGC CCTTCGTGCC GTGGCTCCCC AGCTACGAGC 540 

GGTACGGGGA CTTCAGCGTG GCGGAGATCG CGCCCAGGCT GGCGGCGTGG GCGCGCCGGT 600 

GCGCGCAGCG GGAGAGCGTG GCCAGGACCC TTCACCCGCC GGAPAAGG7G GACGAGTTCA 660 

TCAACCTGCT CAAGAAGACC TACGGCATCG AGTAGTAGAG CGGACTACTA CTAGCAGAGG 720 

AGATGGTACC GGCCGTACGT ACGTGGCTGC CATGCAGTTT TTGTTTCGGT TTGTTTAAAC 780 

GGGACTCCAT GAATGGATGG AACTCTTCTT GGGCTCCGTG TGCTACATAC ACATCTGTAA 840 

AGGTGAACTA AAATCACGGT AAAAACTCGG AAATTAGTTT GTAAAGGGTC CAGCCCCCCT 900 

CCTTTATAAA TAGAGAGGTA TACGGCTGAT AAAAAAAAAA AAAAAAAA 948 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: maize 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: ceb5 .pk0051 . f 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Gly Glu Thr Lys Lys Gly Leu Val Leu Leu Asp Phe Trp Val 
15 10 15 

Ser Pro Phe Gly Gin Arg Cys Arg He Ala Leu Ala Glu Lys Gly He 
20 25 30 

Ala Tyr Glu Tyr Ser Glu Gin Glu- Leu Leu Gly Gly Ala Lys Ser Asp 
35 40 45 

He Leu Leu Arg Ser Asn Pro Val His Lys Lys lie Pro Val Leu Leu 
50 55 60 

His Asp Gly Arg Pro Val Cys Glu Ser Leu Val He Leu Glu Tyr Leu 
65 70 75 80 

Glu Glu Ala Phe Pro Glu Ala Ser Pro Arg Leu Leu Pro Asp Ala Ala 
85 90 95 

Tyr Ala Arg Ala Gin Ala Arg Phe Trp Ala Ala Tyr Ser Asp Lys Val 
100 105 HO 

Tyr Lys Ala Gly Thr Arg Leu Trp Lys Leu Arg Gly Asp Ala Arg Ala 
115 120 125 
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Gin Ala Arg Aia Glu He Val Gin Val Val Arg Asn Leu Asp Giy Glu 
130 135 140 

Leu Gly Asp Lys Ala Phe ?he Gly Gly Glu Ala Phe Gly Phe Val Asp 
145 150 155 160 

Val Ala Leu Val Pro Phe Val Pro Trp Leu Pro Ser Tyr Glu Arg Tyr 
165 170 173 

Gly Aso Phe Ser Val Ala Glu He Ala Pro Arg Leu Ala Ala Trp Ala 
180 185 190 

Arg Arg Cys Ala Gin Arg Glu Ser Val Ala Arg Thr Leu His Pro Pro 
195 200 205 

Glu Lys Val Asp Glu Phe He Asn Leu Leu Lys Lys Thr Tyr Gly He 
210 215 220 

Glu 
225 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CRIN. PK0003. 81 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTTGGGGATG TGGGCGAGCC CTATGGTGAT CAGGGTGGAG TGGGCGCTGC GGCTGAAGGG 60 

CGTCGAGTAC GAGTACGTCG ACGAGGACCT CGCCAACAAG AGCGCCGACC TGCTCCGCCA 120 

CAACCCGGTG ACCAAGAAGG TGCCCGTGCT CGTCCACGAC GGCAAGCCGG TCGCGGAGTC 180 

CACCATCATC GTCGAGTACA TCGACGAGGT CTGGAAGGGC GGCTACCCCA TCATGCCGGG 240 

CGACCCCTAC GAGCGTGCCC AGGCAAGGTT CTGGGCCAGG TTCGCTGAAG ACAAGTGCAA 300 

CGCTGCTCTG TACCCGATCT TCACCGCGAC CGGCGAGGCG CAGCGCAAGG CGGTGCACGA 360 

GGCCCAGCAG TGCCTCAAGA CCCTGGAGAC GGCCTTGGAC GGGAAGAAGT TCTTCGGCGG 420 
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GGACGCCGTG 


GGCTACCTCG 


ACATCGTCGT 


CGGGTGGTTC 


GCGCACTGGC 


TCCCCGTCAT 


4 30 


CGAGGAGGTG 


ACCGGCGCCA 


GCGTCGTCAC 


CGACGAGGAG 


CTGCCGCTGA 


TGAA.GGCCTG 


540 


GTTCGGCCGG 


TTCCTCGCCG 


TTGACGTGGT 


GAAGGCGGCC 


CTGCCCGACA 


GGGACAGGCT 


600 


CCTCGCCGCC 


AACAAGGCCC 


GCCGTGAGCA 


GCTCCTCTCC 


GCGTAGATGG 


CTAGTAATTC 


660 


TGGAGCAGCT 


AGTTTCACCG 


CCGACGCTCA 


TATATTGCTG 


AATAAGGACT 


GGTTGCACTT 


720 


TTGCACGTTG 


TGCAGTGCAG 


CCGAGGTTTG 


GATGACCTCT 


GCCCCTCTGT 


TCCATTTCAG 


780 


AATGGTAGTC 


CCATAATAAT 


GCATATACAT 


CATGCATAAA 


AAAAAAAJ^AA 


AAAAAAAAAA 


840 


(2) 


INFORMATION 


FOR SEQ ID 


NO: 14: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CRIN. PK0003 .Bl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Trp Ala Ser Pro Met Val He Arg Val Glu Trp Ala Leu Arg Leu 
1 5 10 -15 

Lys Gly Val Glu Tyr Glu Tyr Val Asp Glu Asp Leu Ala Asn Lys Ser 
20 25 30 

Ala Asp Leu Leu Arg His Asn Pro Val Thr Lys Lys Val Pro Val Leu 
35 40 45 

Val His Asp Gly Lys Pro Val Ala Glu Ser Thr He He Val Glu Tyr 
50 55 60 

He Asp Glu Val Trp Lys Gly Gly Tyr Pro He Met Pro Gly Asp Pro 
65 70 75 80 

Tyr Glu Arg Ala Gin Ala Arg Phe Trp Ala Arg Phe Ala Glu Asp Lys 
85 90 95 

Cys Asn Ala Ala Leu Tyr Pro He Phe Thr Ala Thr Gly Glu Ala Gin 
100 105 110 

Arg Lys Ala Val His Glu Ala Gin Gin Cys Leu Lys Thr Leu Glu Thr 
115 120 125 

Ala Leu Asp Gly Lys Lys Phe Phe Gly Gly Asp Ala Val Gly Tyr Leu 
130 135 140 
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Asp lie Val Vai Gly Trp Phe Aia His Trp Leu Pro Vai lie GIu Glu 
145 150 155 

Val Thr Gly Ala Ser Val Val Thr Asp Glu Glu Leu Pro Leu Met Lys 
165 170 1*^5 

Ala Tro Phe Gly Arg Phe Leu Ala Val Asp Val Val Lys Ala Ala Leu 
180 185 190 

Pro Asp Arg Asp Arg Leu Leu Ala Ala Asn Lys Ala Arg Arg Glu Gin 
195 200 205 

Leu Leu Ser Ala 
210 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ' cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CRIN. PK0014 .G8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGGAGGCGCA GAGCTTCGAC GCGCCCAGCG CCGAGATGGT CTACAGCCTC GCCTTCCTGC 60 

CGCCCACCCT GCCCAAGCAG AACGACAACG GCAACGGCGG CGCGTTCAAC GCCAGGGACG 120 

CCACCGTAGG CAGCAACGCC GACGCGTCCA GCGGCAAGCG CGGTGTGGCC GGGTCACAGC 180 

CGGCGGCGAG CCAGACCAAG GTGAGCGCGC AGAAGGAGGA GGAGATGCTG AAGCTGTTCG 240 

AGCAGAGGAA GAAGGACCTG GAGAAGCTGC TGGACATCTA CGAGCAGCGC CTGGAGGAGG 300 

CCACGTTCCT GGCCGGCGAC AACTTCACCA TCGCCGACCT GTCGCACCTG CCCTACGCGG 360 

ACCACCTCGT CTCCGACCCG CGCTCCCGCC GCATGTTCGA GTCCCGCAAG AACGTCAGCA 420 

GGTGGTGGCA CGACGTCTCC GGCCGCGACA CCTGGAAGTA CGTCAAGACC CTGCAGCGCC 480 

CGCCGTCCAC GTCCACCGAC GCCAGCGCCA AGAACGGCCA GCTGGGCCAG CAGCAGCACC 540 

TGCCGTCGTC CACCGACGGC CACGGCGTGA AGACCCAACG GCTGGTCCAG AACGAGCGGC 600 
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ACTTCTAGCT GTTGCCGTCC CTTCCCGCCG ACGAATAAAC TACCTGCGCC GCC.GCCACCG b60 

CCGCCATCCA TCAACATGGT TCCTTGTGCT GTTCGTGTCG TTTTCATACG TCATACGTGT 720 

CTTGCTGCTT TTGAAGCTCC GTTCCCGGGT GCAGGGACCT ACGAGTCCAT TCCGTCGTTT '780 

GCTGATTCTG TTCGTCGTGt AATAAAATGA P-AACCCCACC CCGTTTT3AA TGAAAJ^AAAA 840 
AAAAAAAAAA AAAAAAAAAA A 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: maize 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: crln.pk0014 .g8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Val Tyr Ser Leu Ala Phe Leu Pro Pro Thr Leu Pro Lys Gin Asn 
15 10 15 

Asp Asn Gly Asn Gly Gly Ala Phe Asn Ala Arg Asp Ala Thr Val Gly 
20 25 30 

Ser Asn Ala Asp Ala Ser Ser Gly Lys Arg Gly Val Ala Gly Ser Gin 
35 40 45 

Pro Ala Ala Ser Gin Thr Lys Val Ser Ala Gin Lys Glu Glu Glu Met 
50 55 60 

Leu Lys Leu Phe. Glu Gin Arg Lys Lys Asp Leu Glu Lys Leu Leu Asp 
65 70 75 80 

He Tyr Glu Gin Arg Leu Glu Glu Ala Thr Phe Leu Ala Gly Asp Asn 
85 90 95 

Phe Thr He Ala Asp Leu Ser His Leu Pro Tyr Ala Asp His Leu Val 
100 105 110 

Ser Asp Pro Arg Ser Arg Arg Met Phe Glu Ser Arg Lys Asn Val Ser 
115 120 125 

Arg Trp Trp His Asp Val Ser Gly Arg Asp Thr Trp Lys Tyr Val Lys 
130 135 140 

Thr Leu Gin Arg Pro Pro Ser Thr Ser Thr Asp Ala Ser Ala Lys Asn 
145 150 155 160 
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Gly Gin Leu Giv Gin Gin Gin His Leu Pro Ser 5er Thr Asp Gly His 
165 170 l''^ 

Giy Vai Lvs Thr Gin Arg Leu Vai Gin Asn Giu Arg His Phe 
180 135 190 

(2) INFORMATION FOR 3EQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91? base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: M. 15 . 5 . D06. SK20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATGGCGGAGG TGGAGGCGAC GGTGGGGCGA CTGATGCTGT ACTCGTACTG GCGCAGCTCG 60 

TGCTCCCACC GTGCCCGCAT CGCTCTCAAT CTCAAAGGTG TGGATTACGA GTACAAGGCG 120 

GTGAACCTTC TCAAGGGCGA GCAGTCTGAT CCAGAATTCG TCAAGCTTAA TCCTATGAAG 180 

TTCGTCCCTG CGTTGGTTGA TGGCAGTTCT GTAATAGGTG ACTCTTACGC GATAACACTG 240 

TATTTGGAGG ACAA6TACCC AGAGCCTCCT CTTCTACCTC AAGACCTTCA AAAGAAAGCT 300 

TTGAATCACC AGATTGCAAG CATTGTAGCT TCTGGTATTC AACCTCTCCA TAACCTCACA 360 

GTGTTGAGGT TCATTGACCA GAAGGTTGGT GCAGGGGAGA GTGTGTTGTG GACTCAACAA 420 

CAAATCGAGA GAGGTTTCAC AGCTATTGAG AACCTGATAC AACTAAAAGG ATGCGCCGGG 480 

AAGTATGCAA CAGGAGATGA AGTCCAACTG GCAGATGTAT TCCTTGCACC CCAGATCTAT 540 

GCAGCCATTG AACGCACTAA AATTGACATG TCAAACTACC TCACTCTTGC TAGGCTCCAC 600 

TCGGAGTACA TGTCACACCC TGCGTTTGAA GCAGCGCTCC CTGGCAAGCA ACCGGACGCC 660 

CCTTCATCCT CCTAGGAACT GCACCCTAGT GTGTTGTTCC TCTGAATATA TATATATATA 720 

TATGTATACT TCTGTAAGAA TTAATAATTA CAGAGTTTCG TCTGCTATGT CGAAAAATGT 780 

CAAAAGTTTT TGTGATTTCA GAGACTAGCG GCATGAAGCG TCGTTGTGGA TCTGGCCGTC 840 

GTCCTCATGT GGCATCTGTG ATTTCAGGGC ATGCACTTCG TCTTAGAAGG GAAAAAAAAA 900 
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^■^17 



AAAAPJWVAA AAAAAAA 

(2) INFORMATION FOR 3EQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 
(31 TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: M. 15 . 5 - D06. SK20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ala Glu Val Glu Ala Thr Val Gly Arg Leu Met Leu Tyr Ser Tyr 
1 5 10 15 

Trp Arg Ser Ser Cys Ser His Arg Ala Arg He Ala Leu Asn Leu Lys 
20 25 30 

Gly Val Asp Tyr Glu Tyr Lys Ala Val Asn Leu Leu Lys Gly Glu Gin 
35 40 45 

Ser Asp Pro Glu Phe Val Lys Leu Asn Pro Met Lys Phe Val Pro Ala 
50 55 60 

Leu Val Asp Gly Ser Ser Val He Gly Asp Ser Tyr Ala He Thr Leu 
65 70 75 80 

Tyr Leu Glu Asp Lys Tyr Pro Glu Pro Pro Leu Leu Pro Gin Asp Leu 
85 90 95 

Gin Lys Lys Ala Leu Asn His Gin He Ala Ser He Val Ala Ser Gly 
100 105 110 

He Gin Pro Leu His Asn Leu Thr Val Leu Arg Phe He Asp Gin Lys 
115 120 125 

Val Gly Ala Gly Glu Ser Val Leu Trp Thr Gin Gin Gin He Glu Arg 
130 - 135 140 

Gly Phe Thr Ala He Glu Asn Leu He Gin Leu Lys Gly Cys Ala Gly 
145 150 155 160 

Lys Tyr Ala Thr Gly Asp Glu Val Gin Leu Ala Asp Val Phe Leu Ala 
165 170 175 

Pro Gin He Tyr Ala Ala He Glu Arg Thr Lys He Asp Met Ser Asn 
180 185 190 

Tyr Leu Thr Leu Ala Arg Leu His Ser Glu Tyr Met Ser His Pro Ala 
195 200 205 
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Phe Glu Ala Ala Leu Pro Gly Lys Gin Pro Asp Ala Pro Ser 3er Ser 
210 215 220 

(2) IMFOE^TION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CRIN. PK0040 • E12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CACCTGCTGT ATCTCATTAC CATCTGCATC TGGTTGCCCG TTGATTGAGA AGGAGGAGCT 60 

GAGGGCCATG GCGACCGAGA AGCCCATCCT GTACAACGCC TGGATCAGCT CCTGCTCCCA 120 

CCGTGTTCGC ATCGCACTCA ACCTCAAAGG TGTGGATTAC GAGTACAAGT CGGTAAACCC 180 

TAGGACAGAT CCAGATTATG AAAAAATCAA TCCAATCAAA TATATTCCAG CATTAGTAGA 240 

TGGGGACATA GTCGTTTCTG ATTCTCTTGC CATCTCATTG TATTTGGAAG ATAAGTATCC 300 

TGAGCATCCA CTCCTGCCTA AAGATCTCAA GAGGAAAGCT CTTAATCTTC AGATTGCAAA 360 

CATTGTTTGT TCAAGCATTC AACCTCTTCA AGGCTATGCT GTTATTGGTC TGCACGAGGG 420 

TAGGATGAGC CCAGATGAGG GCCTTCATAT TGTTCAAAGT TATATTGACA AGGGATTCAG 480 

AGCGATCGAA AAGCTGTTGG AAGGATGTGA GAGTAAATAT GCTACTGGAG ATGATGTCCA 540 

ATTGGCAGAT GTGTTCCTTG AACCACAGAT ACATGCCGGC ATAAATCGCT TCCAAATCGA 600 

TATGTCGATG TACCCAATCT TGGAGAGGCT CCATGATGCA TACATGCAAA TTCCCGCATT 660 

CCAAGCCGCG CTTCCTAAAA ATCAACCAGA CGCACCTTCA TCATAATCAT CAAGATTATC 720 

TCAATAATTT GCATGTCATT TTGTAATAAT TTGGATAGGG AGCCACTGCT TCCTCCATCC 780 

CGTTGTGGAT CAAAAGGGTG AACGATTGGC ACTTACCTGC ATGGTCCAAT ACCTATTATA 840 

TTTCTTAAAC AGATACTATT TACGGCTATT GTAATTTAAG CCCAAAAAAA AAAAAAAAAA 900 

AAAAAAAAAA AAAAAAAAA 
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(2) INFORMATION FOR 3EQ ID NO: 20: 

(i) sequence: CHARACTERISTICS: 

{A; LENGTH: 212 amino acids 
(3} TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CRIN . PK0040 . E12 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 

Met Ala Thr Glu Lys Pro He Leu Tyr Asn Ala Trp He Ser Ser Cys 
15 10 15 

Ser His Arg Val Arg He Ala Leu Asn Leu Lys Gly Val Asp Tyr Glu 
20 25 30 

Tyr Lys Ser Val Asn Pro Arg Thr Asp Pro Asp Tyr Glu Lys He Asn 
35 40 45 

Pro He Lys Tyr He Pro Ala Leu Val Asp Gly Asp He Val Val Ser 
50 55 60 

Asp Ser Leu Ala He Ser Leu Tyr Leu Glu Asp Lys Tyr Pro Glu His 
65 70 75 80 

Pro Leu Leu Pro Lys Asp Leu Lys Arg Lys Ala Leu Asn Leu Gin He 
85 90 95 

Ala Asn He Val Cys Ser Ser He Gin Pro Leu Gin Gly Tyr Ala Val 
100 105 110 

He Gly Leu His Glu Gly Arg Met Ser Pro Asp Glu Gly Leu His He 
115 120 125 

Val Gin Ser Tyr He Asp Lys Gly Phe Arg Ala He Glu Lys Leu Leu 
130 135 140 

Glu Gly Cys Glu Ser Lys Tyr Ala Thr Gly Asp Asp Val Gin Leu Ala 
145 150 155 160 

Asp Val Phe Leu Glu Pro Gin He His Ala Gly He Asn Arg Phe Gin 
165 170 175 

He Asp Met Ser Met Tyr Pro He Leu Glu Arg Leu His Asp Ala Tyr 
180 185 190 

Met Gin He Pro Ala Phe Gin Ala Ala Leu Pro Lys Asn Gin Pro Asp 
195 200 205 
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Ala Pro Ser Ser 
210 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: " 996 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNES5: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CEB5 . PK004 9 . All 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CATCGATCCG CCATTGCTCA CCGCACAAGT GCACGCTCAC CTCACACACG CAGCTAAGTA 60 

GCTAACGCCG TAGGCGAGAA CAAGAAAAGG CTCGACATGG CCGAGGAGAA GAAGCAGGGC 120 

CTGCAGCTGC TGGACTTCTG GGTGAGCCCA TTCGGGCAGC GCTGCCGCAT CGCGCTGGAC 180 

GAGAAGGGCC TGGCCTACGA GTACCTGGAG CAGGACCTGA GGAACAAGAG CGAGCTGCTC 240 

CTCCGCGCCA ACCCGGTGCA CAAGAAGATC CCCGTGCTGC TGCACGACGG CCGCCCCGTC 300 

TGCGAGTCCC TCGTCATCGT GCAGTACCTC GACGAGGCGT TCCCGGAGGC GGCGCCGGCG 360 

CTGCTCCCCG CCGACCCCTA CGCGCGCGCG CAGGCCCGCT TCTGGGCGGA CTACGTCGAC 420 

AAGAAGCTGT ACGACTGCGG CACCCGGCTG TGGAAGCTCA AGGGGGACGG CCAGGCGCAG 4B0 

GCGCGCGCCG AGATGGTCGA GATCCTCCGC ACGCTGGAGG GCGCGCTCGG CGACGGGCCC 540 

TTCTTCGGTG GCGACGCCCT CGGCTTCGTC GACGTCGCGC TCGTGCCCTT CACGTCCTGG 600 

TTCCTCGCCT ACGACCGCTT CGGCGGCGTC AGCGTGGAGA AGGAGTGCCC GAGGCTGGCC 660 

GCCTGGGCCA AGCGCTGCGC CGAGCGCCCC AGCGTCGCCA AGAACCTCTA CCCGCCCGAG 720 

AAGGTCTACG ACTTCGTCTG CGGGATGAAG AAGAGGCTGG GCATCGAGTA GAGCATCCAT 780 

CGGTCGGCCG GTGGCTGGCC GGGAGTAATA ATGACGAACC AATAATCTAG TTTTGGTTTT 840 

AGTGTGCTCA GCAGAGCAGT TCGTGTTCAT GAGTTCGTCG TCGTTGTATT TTCTATTGTC 900 

AGCGGTGGCA GCGCCGTACG TGTTGCCTCG TACACCACAA CCGAATAAAT GGTTATGAAT 960 
TTCTTCTTGT TGTCTTAAAA AAAAAAAAAA AAAAAA 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CEB5 . PK004 9 . All 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Ala Glu Glu Lys Lys Gin Gly Leu Gin Leu Leu Asp Phe Trp Val 
15 10 15 

Ser Pro Phe Gly Gin Arg Cys Arg He Ala Leu Asp Glu Lys Gly Leu 
20 25 30 

Ala Tyr Glu Tyr Leu Glu Gin Asp Leu Arg Asn Lys Ser Glu Leu Leu 
35 40 45 

Leu Arg Ala Asn Pro Val His Lys Lys He Pro Val Leu Leu His Asp 
50 55 60 

Gly Arg Pro Val Cys Glu Ser Leu Val He Val Gin Tyr Leu Asp Glu 
65 70 75 80 

Ala Phe Pro Glu Ala Ala Pro Ala Leu Leu Pro Ala Asp Pro Tyr Ala 
85 90 95 

Arg Ala Gin Ala Arg Phe Trp Ala Asp Tyr Val Asp Lys Lys Leu Tyr 
100 105 110 

Asp Cys Gly Thr Arg Leu Trp Lys Leu Lys Gly Asp Gly Gin Ala Gin 
115 120 125 

Ala Arg Ala Glu Met Val Glu He Leu Arg Thr Leu Glu Gly Ala Leu 
130 135 140 

Gly Asp Gly Pro Phe Phe Gly Gly Asp Ala Leu Gly Phe Val Asp Val 
145 150 155 160 

Ala Leu Val Pro Phe Thr Ser Trp Phe Leu Ala Tyr Asp Arg Phe Gly 
165 170 175 

Gly Val Ser Val Glu Lys Glu Cys Pro Arg Leu Ala Ala Trp Ala Lys 
ISO 185 190 

Arg Cys Ala Glu Arg Pro Ser Val Ala Lys Asn Leu Tyr Pro Pro Glu 
195 200 205 
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Lys Val Tyr Asp Phe Val Cys Gly Mec Lys Lys Arg Leu Gly lie Giu 
210 215 220 

(2) INFORMATION FOR 3EQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MAIZE 

(vii) IMMEDIATE SOURCE: 

( B ) CLONE : CS 1 • PKO 0 5 9 . E2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGCACGAGAC GACATCGAAG GAGCCTGCGA AGCGAGCGAG AGTCTATAAT GGCGGACGGA 60 

GGCGAGCTGC AGCTGCTGGG CTCATGGTAC AGCCCCTACG TGATCCGCGC CAAGGTGGCG 120 

CTGGGGCTGA AGGGGCTCAG CTACGAGTTC GTCGAGGAGG ACCTCTCCCG CAAGAGCGAC 180 

CTGCTGCTGA AGCTCAACCC GGTGCACAGG AAGGTGCCCG TGCTGGTCCA CGGCGGCCGC 240 

CCCGTGTGCG AGTCGCTCGT CATCCTGCAG TACGTCGACG AGACCTGGGC AGGCACCGGG 300 

ACCCCTCTCC TCCCCGCCGA CGCCTACGAC CGCGCCATGG CTCGCTTCTG GGCAGCCTAC 360 

GTCGACGACA AGTTCTACAA GGAGTGGAAC CGGCTGTTCT GGTCGACGAC GGCGG AGAAG 420 

GCGGCGGAGG CGCTCGGCGT CGTCGTCCCC GTGGTGGAGA CGCTGGAGCA GGCGTTCAGG 480 

GAGTGCTCCA AAGGGAAACC TTCTTCGGCG GCGACGCCGT CGGGCTCGTG GACATCGCGC 540 

TCGGGAGCTT CGTGGTGTGG ATCAGGGTGG TGGACGAGGC GGCCGGCGTA AAGCTTCTGG 600 

ACGAGGCCAA GTTCCCGGCC TTGACGGCGT GGGCGGAGCG CTTCTTGGCG GTGGACGCCG 660 

TGAAGGAGGT GATGCCGGAC GCCGGAAGGC TGTTGGAGCA CTACAAGGGG TTTCTGGCTA 720 

AACGGTCTCC ACCTGCTGGT TACTGAACGC TGTAACTGTA AGCCTGTAAC AGCAAGCTCA 780 

GTGTTCGTGT ACTTTTCCGT GCGTTAACGT GTACTAGAGT TCAGGAAAGG CTTTGATTCT 840 

GTCCAGAGTC CAGACGAATA AACGAATGTT TTTTATAAAA AAAAAAAAAA AAAAA 895 
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(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not reievan-: 

{ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: MJVI2E 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: CSl . PK0059 . £2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Ala Asp Gly Gly Glu Leu Gin Leu Leu Gly Ser Trp Tyr Ser Pro 
15 10 15 

Tyr Val He Arg Ala Lys Val Ala Leu Gly Leu Lys Gly Leu Ser Tyr 
20 25 30 

Glu Phe Val Glu Glu Asp Leu Ser Arg Lys Ser Asp Leu Leu Leu Lys 
35 40 .45 

Leu Asn Pro Val His Arg Lys Val Pro Val Leu Val His Gly Gly Arg 
50 55 60 

Pro Val Cys Glu Ser Leu Val He Leu Gin Tyr Val Asp Glu Thr Trp 
65 70 75 80 

Ala Gly Thr Gly Thr Pro Leu Leu Pro Ala Asp Ala Tyr Asp Arg Ala 
85 90 95 

Met Ala Arg Phe Trp Ala Ala Tyr Val Asp Asp Lys Phe Tyr Lys Glu 
100 105 110 

Trp Asn Arg Leu Phe Trp Ser Thr Thr Ala Glu Lys Ala Ala Glu Ala 
115 120 125 

Leu Gly Val Val Val Pro Val Val Glu Thr Leu Glu Gin Ala Phe Arg 
130 135 140 

Glu Cys Ser Lys Gly Lys Pro Ser Ser Ala Ala Thr Pro Ser Gly Ser 
145 150 155 160 

Trp Thr Ser Arg Ser Gly Ala Ser Trp Cys Gly Ser Gly Trp Trp Thr 
165 170 175 

Arg Arg Pro Ala 
180 
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The isolation and recoirtDinant expression of maize-specific 
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to screen for potential substrates and inhibitors of said 
GST enzymes. 
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